The Analytical Database Built for AI Agents

UC Berkeley Research

The Age of Agentic Speculation

New research from UC Berkeley identifies four key characteristics of how AI agents interact with databases. They all revolve around agentic speculation: agents formulate a hypothesis and leverage the underlying database to find answers. Well performing agentic systems often have agents collaborating to solve a single problem. This means you can easily have hundreds of parallel agents asking similar questions. Firebolt was built for exactly these patterns.

High Throughput

AI agents generate massive volumes of parallel and sequential requests to solve complex tasks efficiently.

Firebolt can scale to support 4000+ QPS.

Shared Computation

Many agents request access to similar data. This allows optimizing queries through shared computation.

Cross-query optimization automatically shares computation across queries.

Grounding Hints

Agents perform better when they receive background information about the underlying data.

Firebolt supports rich metadata to annotate all your objects in the database.

Heterogeneous Workloads

Agentic queries can range from coarse-grained initial exploration to complete attempts solving the tasks.

Firebolt scales out and up to handle large data and query volumes. The query planner finds an efficient plan for every query.

AI-First Integration

Built for the Modern AI Ecosystem

Firebolt integrates seamlessly with the tools and protocols that power today's AI applications.

PostgreSQL Compatible

Native PostgreSQL dialect and wire protocol support for seamless integration with existing AI tools and frameworks.

Apache Iceberg Support

Built-in support for Apache Iceberg, enabling efficient data lakehouse architectures for AI workloads.

Open-Source MCP Server

Model Context Protocol server for direct AI model integration and context sharing.

A2A Protocol

Agent-to-Agent communication protocol for sophisticated multi-agent AI systems.

LangChain Integration

Native support for LangChain and other popular AI frameworks for rapid development.

Embeddings and Semi Structured Support

Seamless AI Integration Flow

Performance Excellence

Built for Speed and Scale

The fastest analytical query engine in the world, optimized for AI workload patterns.

Fastest Query Engine >>

10x faster than traditional analytical databases for complex AI workloads

Infinite Scale >>

Scale Up & Out and seamlessly handle growing AI agent demands with elastic scaling

Real-Time Data Ingestion >>

High-volume data ingestion capabilities for the fastest data generation rates.

Built-in CDC from PostgreSQL/MySQL

Kafka streaming ingest

Real-time processing

High Concurrency >>

Proven performance on 10-node engine for agentic workloads

4000+ QPS!

Elastic auto-scaling

Query optimization

Fresh Data Guarantee >>

Work on the freshest data available, ensuring AI agents make decisions on current information.

Sub-second query latency

Real-time streaming

Immediate consistency

Real-Time Performance Architecture

Technical Requirements

Data Systems for Agentic AI

The foundational capabilities required to support autonomous AI agents in production environments

Integration & Interoperability

Postgres compatible SQL dialect - to help agents build the queries.
Postgres compatible wire protocol for integration with existing tools.
Support for AI ecosystem protocols and frameworks - MCP, A2A, LangChain.
Fits into Python ecosystem - from inside with Python UDFs, from outside with Pandas Dataframe-like APIs.
Integration with LLM providers of user's choice.

Efficient Retrieval

Hybrid search: Mix of symbolic queries (SQL) and semantic search (vector similarity)
Vector search indexes for approximate nearest neighbor (ANN) crucial for real-time agent loop performance
Planner which can deal with complex machine generated SQL
Approximate query results and sampling to serve agent probes

Long-term Memory & State Management

Agentic memory: The agent needs a place to persist beliefs, goals, plans, and past interactions. Fast DML and explicit transactions
Flexible schemas: Agents often deal with heterogeneous and evolving data (text, embeddings, structured facts). Handle both structured (SQL tables) and unstructured (documents, vectors, JSON) data
Temporal/versioned data: Built using copy-on-write and zero-copy-clone mechanisms - allows low cost branching and fast transaction rollback

Concurrency & Coordination

Multi-agent state sharing: When multiple agents act on the same data, the database must handle concurrency, isolation, and conflict resolution
Task queues & workflows: Agents often need transactional coordination between steps

Trust, Safety, and Auditability

Provenance & lineage: Ability to track where knowledge came from
Access control: Fine-grained security (agents may have different capabilities/roles)
Audit logs: Required for debugging autonomous behavior

Scalability & Performance

Low-latency queries: Since agent loops often run synchronously with user requests, retrieval must be sub-100ms
Horizontal scale: Agents can generate and consume large volumes of data quickly

The Analytical DatabaseBuilt for AI Agents