Sitemap

Why Most AI Agents Won’t Run on Your Warehouse

10 min readJul 21, 2025

There’s a gold rush happening in enterprise tech. Every company wants in on the AI agent revolution. Intelligent software that doesn’t just report on the business, but runs it? Sounds amazing.

And the pitch sounds irresistible: “You already have all the data. It’s in the warehouse. It’s your single source of truth. That’s all you need to power AI agents.”

This framing is everywhere. The gravitational pull of the data warehouse, decades of tooling, investment, and executive familiarity, makes it feel like the obvious foundation for intelligent systems. But it’s not.

The majority of business AI agents are not analysts.

They’re not waiting to generate a quarterly report. They’re real-time decision-makers, embedded in the flow of business, reacting to customer chats, changes in inventory, or moments of buyer intent as they happen. And for that, the data warehouse is the wrong tool for the job.

AI apps powered by a warehouse are the wrong mental model. We’re trying to power a nervous system with an archive. We’re expecting a library to run a factory. We’re designing self-driving cars that stop to ask a librarian for directions.

This post is about why that thinking persists, why it fails, and how to escape it. We’ll explore real-world examples where the warehouse model breaks down, and what history teaches us about misapplying old paradigms to new technologies. You’re not building an analyst with a chat interface. You’re building software that acts. And that means ditching the warehouse as your foundation and wiring agents into the live pulse of your business.

The Tyranny of the Warehouse Mindset

For years, the prevailing architecture was simple: collect everything, clean it, centralize it. Then analyze it. That model powered the rise of Business Intelligence, executive dashboards, and a generation of data-driven strategy. The warehouse became the “brain” of the enterprise, where questions were answered and reports were born.

Press enter or click to view image in full size
Standard ETL/ELT Process to Build Business Intelligence
Standard ETL/ELT Process to Build Business Intelligence

But success breeds inertia. And today, many organizations are trying to extend that same model into the world of AI agents.

The mental trap goes something like this: We already have all the data. It’s in the warehouse. There’s lots of compute there as well. Let’s just point the AI at it.

That mindset fundamentally misunderstands how most AI agents work.

AI agents are not just smarter dashboards. They’re software applications that are stateful, reactive, and operational. They should monitor business events, make decisions, and trigger actions. They have memory. They call APIs. They need uptime, low latency, and observability. That’s not a data problem. It’s an platform engineering problem. And it belongs to the part of the organization best equipped to solve it: the platform and application teams.

Treating agents like analytical tools and handing them off to the data team misses the mark entirely. Building an AI agent is not about training models from scratch. It’s typically about composing business logic on top of powerful pre-existing models. That’s software engineering work, and it needs to be deployed, maintained, monitored, and secured like any other production application.

That’s why agents must live in the operational estate of the business, close to the systems they’re meant to observe, the APIs they need to call, and the data streams they depend on to make timely decisions. The same concerns that apply to your app stack: latency, scaling, disaster recovery, access control, apply to your agents.

Warehouses are libraries. Great for research, useless in a fire.

Agents Need a Nervous System, Not Just a Memory Vault

For years, we’ve talked about the data warehouse, or more recently, the data lakehouse, as the “brain” of the enterprise. It’s a tidy analogy. After all, the warehouse can theoretically store everything, it’s centralized, and it feeds decision-making. But as a metaphor for intelligence, it falls apart.

Warehouses, and databases more broadly, don’t think. They don’t perceive. They don’t act. They sit quietly, waiting to be queried. They don’t initiate anything. If you want information, you have to know what you’re looking for, formulate a question, and pull the answer out manually or via a BI tool. That’s not a brain. That’s a memory vault.

Ironically, large language models (LLMs) are much closer to what we’ve been calling a “brain” all these years. They can reason, summarize, infer, and respond flexibly to new prompts. But they have their own shortcoming: they don’t know anything about your business unless you tell them. They don’t know who your customers are, what just happened in your logistics system, or which support ticket is about to escalate, unless they’re connected to live, trusted, structured context.

Press enter or click to view image in full size
Prompt Assembly Process for Giving a Model Task-Specific Data
Prompt Assembly Process for Giving a Model Task-Specific Data

That’s where agents come in. An AI agent is not just a model, it’s a living full body system. It can perceive, decide, and act.

Press enter or click to view image in full size
The Full Body System of an AI Agent
The Full Body System of an AI Agent

And to do this effectively, it needs access to real-time data (to perceive), memory and logic (to decide), and tool interfaces (to act). That means plugging the agent into a live, operational environment, not a passive archive.

This is the real architectural divide:

Press enter or click to view image in full size
Warehouse Versus Agent Thinking
Warehouse Versus Agent Thinking

Would you build a self-driving car that queries the DMV to know if the light just turned red?

Of course not. But that’s exactly what you’re doing when you try to power an AI agent from a warehouse. The warehouse might have the vehicle’s registration history, service records, and driving habits from last month. But it doesn’t know that the light just changed, or that a pedestrian just stepped into the crosswalk. It knows the past, it has no perception of the present.

A better metaphor for what agents need is the nervous system.

Just like humans, agents need to integrate signals from across the body (the business) in real time; what’s happening now in support, logistics, CRM, or finance, and coordinate a response. And just like humans, agents also benefit from memory: knowing past interactions, past issues, past behavior. But memory alone isn’t enough. You need live input to make live decisions.

The warehouse may tell you what happened. But it can’t tell you what’s happening. And it certainly can’t help your agents act in the moment.

Agents Belong in the Operational Estate

Consistently when companies decide to invest in AI agents, I see the work land with the data team. It seems logical: they “own the data,” after all. But for most types of agents, this is a category error.

Most AI agents aren’t analytical projects. They’re operational software.

  • The Data Estate is built for history: batch jobs, BI tools, and offline models.
  • The Operational Estate is built for now: live systems, streaming data, real-time APIs.

They have different owners, different cadences, and even different definitions of data quality. To the data team, quality means accuracy and consistency. To a platform team, it likely means timeliness and availability.

AI agents need to observe business events as they happen, maintain short-term state, and trigger immediate actions. That means they must live in the operational estate, close to the data, close to the tools, and managed by the platform teams who build and run production systems.

Where the Warehouse Fails: Three Agents, Three Broken Promises

Let’s take some real world use cases. It’s one thing to say the warehouse is the wrong foundation. It’s another to watch what happens when you try to build agents on top of it.

Here are three stories of agents that could have worked but didn’t.

The Customer Support Agent Who Was Always Behind

A customer starts a chat: “Where’s my order?”

The agent checks the data warehouse. It pulls last night’s snapshot. According to the batch, the package was processed. Great. But the customer already knows that. What they don’t know is that their package went out for delivery 20 minutes ago, something only the fulfillment system knows, in real time.

Worse: the customer is frustrated. Their tone is escalating. The agent should route the chat to a human. But the warehouse doesn’t do live sentiment tracking. It can’t see the rising tension. By the time the handoff happens, if it happens, the customer has already churned.

The agent had the data. It just didn’t have the right data. Not live, not actionable, not in time.

The Sales Agent Who Missed the Moment

A prospect lands on your pricing page. High-intent behavior. A good agent would notice, check the CRM history, and trigger a personalized outreach within minutes.

But the signal doesn’t flow through the sales agent. It flows into Kafka, then into a batch job, then into a warehouse. By the time it’s aggregated and processed, it’s tomorrow. The window is closed.

Meanwhile, the agent is still working off last week’s activity report: “Maybe follow up with Company X?” Too late. Jane from Company X sent an email three hours ago titled “Urgent question on pricing.” No one replied. She’s moved on.

The Livestream Assistant Who Was Watching the Replay

The livestream is live. Thousands of viewers are pouring into the chat. Questions scroll by. Comments pile up. Someone asks the same thing ten times. Another drops an account-specific issue. A third leaves entirely after not getting a response.

There’s an agent watching (kind of). It’s connected to a warehouse that’s loading chat logs in 10-minute intervals.

It can summarize the most common questions, after the event ends. It can analyze sentiment, after viewers are already gone. It can’t recommend follow-ups to the speaker or highlight VIP participants in the moment.

It’s like hiring an event assistant who only reads the transcript after the show.

Same Pattern, Same Problem

Each of these agents failed in different ways, but for the same reason: they were plugged into a memory vault instead of a nervous system. They weren’t built to perceive. They weren’t built to react. They were built like BI dashboards with a chatbot glued on.

It’s the same reason you don’t run a login system off a data warehouse. Or power a shopping cart with Snowflake. Or check someone’s password against an analytics backend. Warehouses are built for reflection, great for understanding what happened, but blind to what’s happening now.

But most agents aren’t reports. They’re real-time, stateful, production-grade applications. And like any other app that matters, they need to live close to the action: fast infrastructure, low-latency APIs, and always-on event streams.

Trying to build agents on a warehouse is like trying to run Uber off a census. Accurate, comprehensive and completely useless in the moment that matters.

Historical Parallels: Misapplying the Old to the New

This isn’t the first time we’ve misunderstood a new paradigm by trying to squeeze it into an old mental model. History is full of examples where we misapplied what worked in the past to what was emerging and paid the price.

Waterfall vs. Agile

For decades, the Waterfall model dominated software development. It was orderly and linear: gather requirements, design, build, test, release. No changes. No surprises.

As software became more complex, and user needs evolved faster, teams discovered that Waterfall’s rigidity was a liability. Requirements changed mid-project. Features needed iteration. Feedback loops had to shrink. Enter Agile: a new way of working that embraced change, encouraged iteration, and thrived on real-time feedback.

Press enter or click to view image in full size
Waterfall vs Agile Software Methodology
Waterfall vs Agile Software Methodology

Trying to build AI agents on a warehouse is a Waterfall-style mistake. It assumes that data can be prepared ahead of time, decisions can be made in sequence, and actions can follow after the fact. But agents are Agile by nature. They need continuous input, short decision cycles, and the ability to adapt on the fly.

Broadcast TV vs. Interactive Media

For decades, broadcast TV ruled. It was linear, scheduled, and the same for everyone. Content was created in advance and delivered at a fixed time to a passive audience.

Then the Internet changed everything. Streaming, personalization, and real-time interactivity became the new norm. Viewers became participants. Content adapted to context.

Press enter or click to view image in full size
Source: Netflix Is Now Bigger Than Cable TV
Source: Netflix Is Now Bigger Than Cable TV

That’s the shift agents represent. Warehouses are built for broadcast: predefined reports delivered on a schedule. But agents are interactive systems. They respond to users in real time, based on live data and dynamic context.

Trying to power agents with a warehouse is like trying to run Netflix on a VCR. Wrong format, wrong timing, wrong foundation.

Don’t Build a Better Library. Build a Live System.

If there’s one takeaway from the shift to agentic AI, it’s this: you don’t need a smarter archive, you need a system that can sense, decide, and act in real time.

Agents are rarely one-off scripts or offline tools.

They’re long-running, event-driven applications. They observe live signals, maintain state, invoke tools, and adapt to context. And that means they need the same kind of infrastructure you’d use to power any critical production system:

  • Low-latency runtimes
  • Real-time data streams
  • Secure, responsive APIs
  • Monitoring, deployment, and recovery pipelines

To support this, organizations need to realign ownership and responsibility:

  • The Data Team becomes a provider of governed, high-quality historical data for model training, enrichment, and analytical context.
  • The Platform Team owns the real-time backbone: stream processors, operational databases, execution environments, observability, and identity.
  • The Application Team builds the agents themselves, crafting business logic, integrating tools, and delivering real-time experiences.

The data warehouse still plays an essential role. It’s your long-term memory, your source of strategic insight, your offline training ground. But it’s no longer the operational command center. Unless you’re building an analyst agent, your AI shouldn’t be running inside the warehouse compute environment or making decisions based on yesterday’s data.

Agents don’t live in static systems. They live in motion. And if you want them to be intelligent, you have to put them where the signals are.

The Mindset Shift That Makes or Breaks AI Success

The companies that win with AI won’t be the ones that glue a chatbot onto a BI dashboard. They’ll be the ones that embed agents directly into the operational fabric of the business, sensing, deciding, acting, and improving in real time. You need to choose the right terrain for the system you want to build.

So here’s the call to action: Audit where your agents live today.

If they’re still trapped in the library, it’s time to rethink your strategy.

--

--

Sean Falconer
Sean Falconer

Written by Sean Falconer

AI @ Confluent | 100% Canadian 🇨🇦 | Snowflake Data Superhero ❄️ | AWS Community Builder

Responses (2)