How business users query data without SQL using Databricks Genie

Written by

Arvind Divakar

Data Engineering

Why self-service analytics still feels out of reach

There is a pattern I have seen in almost every data team I have worked with. Everyone talks about self-service analytics, but when you look closely, most business users are still dependent on analysts for even simple questions.

Someone from operations wants to know what changed last week. They send a message. An analyst picks it up, writes a query, validates the numbers, and sends back a result. Then comes a follow-up question, and the cycle repeats.

Dashboards help, but only to a point. They answer predefined questions. The moment someone asks something slightly different, you are back to writing SQL, and the bottleneck does not go away. It just shifts.

What tools like Databricks Genie are trying to do is remove that dependency altogether, or at least reduce it significantly.

What Genie is really doing behind the scenes

When people first hear about Genie, the assumption is usually that it is just a chat interface for your data. That is not quite accurate, and the distinction matters.

Genie allows business users to ask questions in plain language and get answers directly from enterprise data, but it does not work on raw data in isolation. The business logic is already given to Genie before a user asks a single question. Definitions like what "on-time delivery" means, how "transit time" is calculated, and what counts as a "delayed shipment" are all configured in advance. Genie is not guessing. It is operating within a defined context.

So when someone asks a question, it is not generating a random SQL query. It is translating that question into something that respects the way the business actually measures things. From the user's perspective it feels simple. Behind the scenes, there is a lot of structure making that possible.

How we approached it in a real-world POC

We explored this by building a Genie-based proof of concept around operational data. The goal was not to test whether the technology works in isolation, but whether it actually changes how business users interact with data.

We used a logistics dataset covering shipment volumes, transit times, delivery performance, and exceptions. The kind of data that usually sits behind multiple dashboards and ad hoc queries. The setup connected our existing warehouse to Databricks through Unity Catalog, with Genie layered on top alongside a basic dashboard interface so users could either look at predefined views or ask questions directly.

How users naturally start interacting with data

When we let business users try it, the biggest shift was not capability. It was behavior.

Instead of thinking in terms of tables or columns, users asked things like "What happened to delivery performance last week?" or "Show me delays by route." The system responded with actual numbers, not approximations, and it worked reliably because the business definitions were already in place. Genie was not trying to interpret vague terms from scratch. It was using the logic we had already defined.

Structuring data so Genie actually works

A significant part of making this work is how Genie is structured internally. You do not point it at your entire data warehouse and expect it to figure things out. You define spaces tied to curated datasets, each with a specific purpose, a defined set of tables, and controlled access. This keeps things clean and predictable.

Within each space, you configure what Genie should understand. You define metrics, describe how different datasets relate to each other, and give it the language of the business. This is where the real work happens. Once that layer is in place, the experience for the user becomes straightforward. They do not need to know anything about schemas or joins. They ask questions the way they normally would, and they get answers that reflect how the business actually works.

How Genie handles unclear questions

One thing I found particularly useful is how Genie handles loosely framed questions. In traditional setups, that kind of question would either fail or return something misleading. With Genie, when proper business descriptions are in place, it can recognize when a term needs clarification and guide the user toward a more precise question. That becomes a strength rather than a limitation. Instead of silently returning the wrong answer, it keeps the interaction grounded in business context and ultimately leads to better decisions.

Where this fits in day-to-day analytics

It is worth being clear about where Genie fits and where it does not. If someone wants to explore data, compare trends, or investigate what changed and why, Genie is highly effective. It removes the need to translate business questions into SQL. But if you are tracking fixed KPIs that need to be monitored consistently, dashboards still play an important role. They provide consistency and speed.

In practice the two work best together. Dashboards give you a stable view of the business. Genie gives you flexibility when you need to go beyond that.

What changed with the introduction of agent mode

One of the more significant recent updates to Genie is the introduction of agent mode, which changes how it handles more complex questions. Earlier, most interactions were single-step: you ask a question and get an answer. Now Genie can break a question into multiple steps, reason through intermediate results, and build toward a final answer. It also carries context across follow-up questions more effectively.

This becomes especially useful when users are not just asking for numbers but trying to understand why something happened. You start to see it behave less like a query interface and more like an analytical assistant.

Why context matters more than the tool itself

If you are thinking about exploring Genie, the most important thing to get right is not the tool. It is the context you provide to it. The better your business definitions, the clearer your metrics, and the cleaner your data relationships, the more reliable Genie becomes. Without that foundation, even a well-configured AI layer will not give you meaningful results.

With it, something useful starts to happen. Business users stop waiting for answers and start exploring data on their own. Analysts spend less time writing repetitive queries and more time building the foundation that makes all of it possible. That shift is modest to describe but significant in practice, and it is what makes this worth getting right.

How we help teams move faster

View our approach