Blog

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod

Blog

Technical Deep Dive: Automated Column Statistics

Collect statistics about the values in your columns to improve query plans.

Hans-Peter Lehmann

Technical Deep Dive: Automated Column Statistics

Collect statistics about the values in your columns to improve query plans.

Hans-Peter Lehmann

Technical Deep Dive: Efficient and ACID Compliant Vector Search Indexes in Firebolt

This deep dive explores how Firebolt implements native vector search indexing

Demian Hespe

Why 99% of Data Teams Give Up on Real-Time And How Artie Changes That

Robin Tang explains how Artie simplifies real-time data streaming and CDC for teams at ClickUp, Substack, and Alloy.

Firebolt Team

Unlocking Faster Iceberg Queries: The Writer Optimizations You're Missing

Your Apache Iceberg tables are slow because of how your data was written into them.

John Kennedy

Eliminating the OLTP vs OLAP Trade-off

MerchJar transformed query times from "couple of minutes" to sub-second on their Amazon ads optimization platform.

John Kennedy

60 Billion Predictions Daily: Inside Credit Karma’s Agentic Data Layer

Maddie Daianu details Credit Karma's data/AI strategy: GCP, 80B ML predictions, and the Unified Consumer Profile.

Firebolt Team

The $100M Problem: How Lyft's Data Platform Prevents ML Failures with Ritesh Varyani at Lyft

Lyft's Ritesh Varyani details their polyglot data strategy unifying Spark, Trino, and ClickHouse with AI.

Firebolt Team

"Where Do I Put My Logs?" A Conversation with TLDCRM's CEO on Solving the Impossible

A conversation with TLDCRM how they built a customer facing API observability solution at scale.

Sergio Ferragut

Late Materialization: How Firebolt Makes Top-K Queries 30x Faster

Continuing the focus on doing less to maximise performance.

John Kennedy

Right size your engines and achieve unparalleled price-performance with firebolt’s granular scaling

Scale one node at a time to adjust compute resources incrementally, ensuring an ideal price-performance ratio.

Krishna Thotapalli

How we mastered dbt: A true story

At Firebolt, we found out that a duet of dbt and Paradime works for our needs.

Olga Braginskaya

Analyzing the GitHub Events Dataset using Firebolt - Querying with Streamlit

Writing a data app, using Streamlit and Jupyter and the Firebolt Python SDK. A multi-series blog.

Alexander Reelsen

Analyzing the GitHub Events Dataset using Firebolt - Incremental Updates with Apache Airflow

Looking at GithubArchive dataset of public events - leveraging Apache Airflow workflows for keeping our data up-to-date.

Alexander Reelsen

Cloud Data Warehouse Market Share Breakdown: Who Are the Top Players in 2025?

Discover the top cloud data warehouse providers dominating the market in 2025.

Firebolt Team

Cloud Data Warehouse Statistics Trends

Discover key cloud data warehouse statistics and trends.

Firebolt Team

Future of Performance is Not About Performance

The data warehousing market has gone absolutely mad over performance. Why is this the case?

Tino Tereshko

How to accelerate Looker performance on Redshift, Snowflake and BigQuery

How to accelerate Looker performance on Redshift, Snowflake and BigQuery? Short-term fixes and the long-term solutions.

Robert Meyer

AI and Predictive Analytics in Cloud Data Warehousing

Discover how AI and predictive analytics are reshaping cloud data warehousing

Monica Cisneros

Snowflake vs Databricks

More and more, people are asking me “how do you compare Snowflake and Databricks?” We did our best to answer.

Robert Meyer

Snowflake vs. Redshift: Cloud Data Warehouse Comparison

A detailed comparison of Snowflake vs. Redshift, by architecture, scalability, performance, use cases and cost.

Robert Meyer

Pruning even more data with late materialization

Learn how Late Materialization speeds up top-K queries by delaying column scans.

Maximilian Rieger

Firebolt Connector for Confluent : Real-Time Applications, Powered by Streaming Data

Firebolt Connector for Confluent now validated, enabling Real-Time applications, powered by streaming data.

Abhishek Reddy

Block Bad Data Before the Write with Nike’s Ashok Singamaneni

Ashok Singamaneni highlights Spark Expectations, an open-source tool that improves reliability with pre-write DQ checks.

Firebolt Team

FuzzBerg: Hunting Bugs in Iceberg and file-format readers

Firebolt open-sources FuzzBerg to accelerate security testing of Iceberg and other file based readers.

Abhishek Sen

Unlock Real-Time Analytics: Connecting Firebolt to Tableau Cloud

Real-Time Data analytics powering real time dashboards that deliver real insights

Kushagr Nagpal

Implementing Firebolt MERGE Statement

Technical deep dive on the powerful MERGE SQL command, enabling simultaneous operations on a single table.

Tali Magidson

Firebolt ARM Rollout

Performance out of the box, means always ensuring you are running on the most performant hardware

Paul Edgington

Building a Chatbot with Firebolt Using Retrieval-Augmented Generation

We built a Firebolt-powered support chatbot using retrieval-augmented generation (RAG).

Firebolt Team

Implementing Explicit Multi-Statement Transactions in a Stateless, Cloud-Native Architecture

A deep dive into the architectural decisions that enable this feature, with focus on Firebolt's metadata layer.

Gil Cizer

Querying Apache Iceberg with Sub-Second Performance

Firebolt's new READ_ICEBERG capability does a lot of heavy lifting to provide low-latency access to your Iceberg tables.

Lorenz Hübschle

Revolutionizing Data Governance with DataStrato’s Unified Open Source Approach

Uncover the future of data governance and explore innovative solutions for a unified data ecosystem with Lisa Cao, Produ

Firebolt Team

Postgres vs. Elasticsearch: The Unexpected Winner in High-Stakes Search for Instacart

Ankit Mittal details how Instacart's search team migrated their system, shifting from Elasticsearch to Postgres.

Firebolt Team

Caching & Reuse of Subresults across Queries

Deep dive into how Firebolt optimizes query performance through caching and reusing results of parts of the query plan.

Alex Hall

Zach Wilson on what makes a great data engineer

How good you are at Spark or Flink ≠ how good you are at data engineering. Zach Wilson explains.

Firebolt Team

How Similarweb Delivers Customer Facing Analytics Over 100s of TBs

According to Yoav Shmaria, VP R&D Platform at Similarweb, the best way to manage data warehouse costs is tagging

Firebolt Team

How Eventbrite is Modernizing its Data Stack

An episode about Eventbrite’s data stack modernization process, and how you get engineers to adopt new technologies

Firebolt Team

Transitioning Scopely’s 5.5 PB Data Platform to the Modern Data Stack

Should data engineering AND BI be handled by the same people?

Firebolt Team

Getting Rid of Raw Data with Jens Larsson

Why would you create ugly data? According to Jens Larsson, don’t even go near raw data.

Firebolt Team

How Bolt Engineers Are Designing Its Next-Gen Data Platform

Bolt engineers are in the midst of designing a new next-gen data platform

Firebolt Team

How Zendesk engineers manage customer-facing data applications

Ananth Packkildurai is Principal Software Engineer at Zendesk and runs one of the strongest newsletters in data

Firebolt Team

How are those data intensive customer facing apps engineered at Gong?

Gong manages hundreds of thousands of videoconferences and millions of emails PER DAY, which add up to hundreds of TBs.

Firebolt Team

How Substack's Data Platform Supports 500K Paying Subscribers

How does Substack's data platform support 500K paying subscribers?

Firebolt Team

Is Self-Service BI a False Promise? Lei Tang of Fabi.ai Thinks So

Lei Tang from Fabi.ai discusses AI's role in creating intelligent semantic layers and proactive BI agents.

Firebolt Team

Building Uber's AI Assistant: How Genie Revolutionizes On-Call Support with Paarth Chothani from Uber

Genie, Uber's AI assistant, tackles on-call issues using internal data & Spark. LLMs impact database optimization.

Firebolt Team

Advanced SQL Query Techniques for Data Engineers

Advanced SQL techniques for improving query efficiency and system performance.

Firebolt Team

Automatic Cache Warmup

Discover how Firebolt proactively fetches data to keep query latency low.

Dan Englund

Firebolt Auror

Firebolt built Auror, a blazing-fast admission webhook to cut latency and secure Kubernetes image validation.

Cem Denizsel

Eliminating Redundant Joins in Firebolt for Faster SQL

Learn how Firebolt speeds up SQL by eliminating redundant joins and handling complex correlated subqueries.

Andrés Senac González

Introducing Firebolt Core - Self-Hosted Firebolt, For Free, Forever

Dive into the workings of the forever free, self-hosted edition of Firebolt’s distributed query engine

Mosha Pasumansky

Making Firebolt Fast By Doing Practically Nothing

Learn about the different methods deployed in Firebolt for reducing the number of scanned rows (aka pruning).

Ori Brostovski

Live Engine Upgrades, Zero Downtime: The Firebolt Method

Discover how Firebolt delivers seamless, no-downtime upgrades using shadow clusters and real-time performance.

Ilya Shakhat

Unlock Conversational Data Interaction: Firebolt MCP Server for Advanced LLM Integration

Supercharge your data workflows by connecting Firebolt to AI tools with the new MCP Server.

Ivan Koptiev

AI in Cloud Data Warehousing

AI-driven cloud data warehousing enables faster, more accurate analytics.

Monica Cisneros

Cloud Data Warehouse vs. Data Lake: Which One Should You Choose?

Compare cloud data warehouses and data lakes and learn the key differences and how to make the right choice.

Firebolt Team

Cloud Data Warehouse vs. Traditional Data Warehousing: Why the Shift?

Discover why businesses are migrating from traditional to cloud data warehouses.

Firebolt Team

Cloud Data Warehouse Solutions for Big Data Analytics

Discover the best cloud data warehouse solutions for big data analytics. Learn about features, & benefits.

Firebolt Team

ETL Best Practices for Data Engineers

Discover top ETL best practices for data engineers to optimize workflows, improve performance, and ensure data accuracy.

Firebolt Team

Big Data Challenges & How to Overcome Them

Discover the top big data challenges and learn practical solutions to overcome them.

Firebolt Team

Cloud Data Warehouse Best Practices: Tips for Maximizing Performance

Learn proven techniques to optimize cloud data warehouse performance.

Firebolt Team

AI & Cloud Data Warehouses: 2025-2030 Market Projections

Discover how Firebolt combines AI capabilities with powerful analytics for faster insights and smarter data management.

Firebolt Team

How to Boost Query Performance with Apache Iceberg in Cloud Data Warehouses

Speed up queries with Apache Iceberg and Firebolt's lightning-fast execution.

Firebolt Team

Why Choose Apache Iceberg Over Traditional Table Formats?

Learn how Apache Iceberg outperforms Hive and Delta Lake.

Firebolt Team

How AI is Transforming ETL in Data Warehousing

AI is automating ETL, making data processing faster, more accurate, and cost-effective.

Firebolt Team

Firebolt vs Snowflake: Cloud Data Warehouse Comparison

We often get asked “what’s the difference between Firebolt and Snowflake?” and it reminds me of Frozen.

Robert Meyer

Firebolt vs Teradata: Comparison Guide

Compare Firebolt and Teradata to determine the best cloud data warehouse for your business. Explore pricing, performance

Firebolt Team

Agent to Agent (A2A) | Enable Seamless Communication & Collaboration

Explore A2A architecture, agent based system design, and how Firebolt supports agent communication at scale.

Firebolt Team

How MCP and A2A Are Powering the Next Generation of Intelligent Workflows

Model Context Protocol and Agent-to-Agent architectures are reshaping how organizations approach intelligent systems.

Firebolt Team

Firing Up Firebolt’s Client Ecosystem

Enable users to not use resources on just maintaining a connection when in fact their client is not doing anything

Bogdan Truta

Exploring your data lake in Firebolt using just TVFs

Discover how Firebolt implements SQL functions for data exploration.

Asya Shneerson

[Dynamic Code Blocks] Exploring your data lake in Firebolt using just TVFs

Discover how Firebolt implements SQL functions for data exploration.

Asya Shneerson

From Zero to 100M Users: Inside Notion’s Data Stack and AI Strategy with Sumit Gupta

Master AI data workflows and key soft skills for your evolving data career, with tips from Notion's Lead BI Engineer.

Firebolt Team

Professors Joe Hellerstein and Joseph Gonzalez on LLMs

Joe Hellerstein and Joseph Gonzalez inspired generations of database enthusiasts and are now on the show

Firebolt Team

Unlocking Simplicity and Security: Firebolt’s New LOCATION Object

Discover the new LOCATION object, a foundational improvement to Firebolt’s data access model.

Chen Burshtein

How Rising Wave Is Redefining Real-Time Data with Postgres Power

The Future of Data Processing: PostgreSQL Evolution with YingJun Wu of Rising Wave.

Firebolt Team

GROUPING SETS as a pure planner rewrite ? Yep - it's possible

Learn how GROUPING SETS work and how Firebolt’s implementation uses smart query planning to execute them efficiently.

Julia Spindler

The Future of Data Warehousing in the Age of AI: 5 Key Trends from Firebolt Forward

Legacy BI tools can’t keep up with AI. Firebolt Forward revealed 5 trends defining the future.

Monica Cisneros

Decomposing Firebolt transactions

Explore how Firebolt transaction manager works

Mosha Pasumansky

Recap - Firebolt Forward: Data Warehousing in the Age of AI

Legacy data warehouses can’t handle AI apps. Firebolt is built for subsecond latency, massive concurrency, and more...

Monica Cisneros

Beyond Database Optimization with AI

Discover groundbreaking, innovative approach to database technology as you tune in to this episode with CEO DucksDB Labs

Firebolt Team

The Process of Running FireScale Benchmarks

Learn about the methodology behind constructing and running the FireScale benchmarks.

Cole Bowden

Introducing FireScale - A Benchmark for High Performance and High Concurrency Analytics Workloads

FireScale reveals Firebolt’s 8x-90x price-performance edge & near-linear concurrency scaling for next-gen analytics.

Manish Agarwal

Under the Hood of Firebolt Compute Families

Dive into why Firebolt introduced a new compute family and when you'd want to use it.

Cole Bowden

Introducing Firebolt Editions and Compute Family Choices

Choose the right edition and compute family to optimize query performance and cost-efficiency with Firebolt.

Manish Agarwal

Robust and efficient geospatial operations using snap rounding (Part III)

We will explore in more detail how Firebolt implements robust operations on geospatial data.

Demian Hespe

How Similarweb Serves 100s of TBs to Worldwide Users in Milliseconds

At SimilarWeb, we analyze internet traffic on a global scale, empowering clients with actionable insights.

Firebolt Team

Firebolt February Release Roundup: Versions 4.12 to 4.14 -> Faster queries, RBAC upgrades, and geospatial boosts

Recap of the three most recent releases in Firebolt, including changes from versions 4.12, 4.13, and 4.14.

Cole Bowden

AI and Data Movement: Trends and Best Practices with Estuary’s Daniel Pálma

Transform data engineering, marketing, real-time data integration and the use of AI with Daniel Pálma’s expert insights.

Firebolt Team

Architecture and Internal Representation of the GEOGRAPHY Data Type (Part II)

Discover how Firebolt structures & optimizes GEOGRAPHY data with S2 cells, shape indexes, and pruning for fast queries.

Demian Hespe

Building Geospatial Support in Firebolt (Part-I)

This blog post explores how Firebolt implements geospatial support under the hood.

Demian Hespe

Firebolt Welcomes Former Oracle and Confluent Leader Hemanth Vedagarbha as President, Overseeing Global Go-To-Market Expansion and Customer-Facing Operations

Firebolt names Hemanth Vedagarbha as President to lead global GTM expansion and scale its AI-powered data warehouse.

Firebolt Team

Firebolt’s zero-copy clone

Firebolt’s Zero-Copy Clone Feature: A cost-efficient way to clone massive tables instantly without duplicating data.

Gil Cizer

Firebolt Trial For 30 Days With $200 Free Credits — Now Open to All

Firebolt now welcomes sign-ups with personal email addresses. Explore Firebolt’s low-latency, high-concurrency CDW.

Jonathan Thein

AI and Data Change Management with Chad Sanderson, CEO Gable AI

In this episode of The Data Engineering Show, Chad Sanderson explores the world of data change management.

Firebolt Team

From MySQL Bottlenecks to Firebolt Power: 28X Faster Analytics at Lower Cost

Vrio's CTO explores how Firebolt reduced ecommerce analytics latency from minutes to milliseconds, while reducing TCO.

Ryan McWilliams

Firebolt Features: Effortless Metadata Management For Faster Workflows

Learn how Firebolt streamlines metadata management with zero-copy cloning, dynamic schema evolution for faster workflows

Jonathan Thein

Firebolt December Release Roundup: Versions 4.9 to 4.11 → Geospatial data, zero-copy cloning, and metadata operations

Firebolt delivers zero-copy cloning, a preview for processing geospatial data, and more this month.

Cole Bowden

Tech Stacks and Tradeoffs: Xudo's Founder on Picking the Right Tools for BI Success

Wouter Trappers shares his slightly unconventional path from philosopher to data consultant and engineer.

Firebolt Team

Firebolt DB Release Roundup: Release versions 4.6 to 4.8-> New Functions, Friendlier SQL, and Enhanced Performance

Firebolt DB Release Roundup: Release versions 4.6, 4.7 and 4.8

Tara Shankar Jana

Firebolt is Now Available in Asia Pacific (Singapore)

We're excited to announce that Firebolt is now available in the Asia Pacific (Singapore) region.

Manish Agarwal

5 Reasons to Use Firebolt

Firebolt delivers lightning-fast analytics with SQL simplicity, cost-effective performance, and high query throughput.

Cole Bowden

We use cookies to give you a better online experience

Blog

Technical Deep Dive: Automated Column Statistics

Technical Deep Dive: Automated Column Statistics

Technical Deep Dive: Efficient and ACID Compliant Vector Search Indexes in Firebolt

Why 99% of Data Teams Give Up on Real-Time And How Artie Changes That

Unlocking Faster Iceberg Queries: The Writer Optimizations You're Missing

Eliminating the OLTP vs OLAP Trade-off

60 Billion Predictions Daily: Inside Credit Karma’s Agentic Data Layer

The $100M Problem: How Lyft's Data Platform Prevents ML Failures with Ritesh Varyani at Lyft

"Where Do I Put My Logs?" A Conversation with TLDCRM's CEO on Solving the Impossible

Late Materialization: How Firebolt Makes Top-K Queries 30x Faster

Right size your engines and achieve unparalleled price-performance with firebolt’s granular scaling

How we mastered dbt: A true story

Analyzing the GitHub Events Dataset using Firebolt - Querying with Streamlit

Analyzing the GitHub Events Dataset using Firebolt - Incremental Updates with Apache Airflow

Cloud Data Warehouse Market Share Breakdown: Who Are the Top Players in 2025?

Cloud Data Warehouse Statistics Trends

Future of Performance is Not About Performance

How to accelerate Looker performance on Redshift, Snowflake and BigQuery

AI and Predictive Analytics in Cloud Data Warehousing

Snowflake vs Databricks

Snowflake vs. Redshift: Cloud Data Warehouse Comparison

Pruning even more data with late materialization

Firebolt Connector for Confluent : Real-Time Applications, Powered by Streaming Data

Block Bad Data Before the Write with Nike’s Ashok Singamaneni

FuzzBerg: Hunting Bugs in Iceberg and file-format readers

Unlock Real-Time Analytics: Connecting Firebolt to Tableau Cloud

Implementing Firebolt MERGE Statement

Firebolt ARM Rollout

Building a Chatbot with Firebolt Using Retrieval-Augmented Generation

Implementing Explicit Multi-Statement Transactions in a Stateless, Cloud-Native Architecture

Querying Apache Iceberg with Sub-Second Performance

Revolutionizing Data Governance with DataStrato’s Unified Open Source Approach

Postgres vs. Elasticsearch: The Unexpected Winner in High-Stakes Search for Instacart

Caching & Reuse of Subresults across Queries

Zach Wilson on what makes a great data engineer

How Similarweb Delivers Customer Facing Analytics Over 100s of TBs

How Eventbrite is Modernizing its Data Stack

Transitioning Scopely’s 5.5 PB Data Platform to the Modern Data Stack

Getting Rid of Raw Data with Jens Larsson

How Bolt Engineers Are Designing Its Next-Gen Data Platform

How Zendesk engineers manage customer-facing data applications

How are those data intensive customer facing apps engineered at Gong?

How Substack's Data Platform Supports 500K Paying Subscribers

Is Self-Service BI a False Promise? Lei Tang of Fabi.ai Thinks So

Building Uber's AI Assistant: How Genie Revolutionizes On-Call Support with Paarth Chothani from Uber

Advanced SQL Query Techniques for Data Engineers

Automatic Cache Warmup

Firebolt Auror

Eliminating Redundant Joins in Firebolt for Faster SQL

Introducing Firebolt Core - Self-Hosted Firebolt, For Free, Forever

Making Firebolt Fast By Doing Practically Nothing

Live Engine Upgrades, Zero Downtime: The Firebolt Method

Unlock Conversational Data Interaction: Firebolt MCP Server for Advanced LLM Integration

AI in Cloud Data Warehousing

Cloud Data Warehouse vs. Data Lake: Which One Should You Choose?

Cloud Data Warehouse vs. Traditional Data Warehousing: Why the Shift?

Cloud Data Warehouse Solutions for Big Data Analytics

ETL Best Practices for Data Engineers

Big Data Challenges & How to Overcome Them

Cloud Data Warehouse Best Practices: Tips for Maximizing Performance

AI & Cloud Data Warehouses: 2025-2030 Market Projections

How to Boost Query Performance with Apache Iceberg in Cloud Data Warehouses

Why Choose Apache Iceberg Over Traditional Table Formats?

How AI is Transforming ETL in Data Warehousing

Firebolt vs Snowflake: Cloud Data Warehouse Comparison

Firebolt vs Teradata: Comparison Guide

Agent to Agent (A2A) | Enable Seamless Communication & Collaboration

How MCP and A2A Are Powering the Next Generation of Intelligent Workflows

Firing Up Firebolt’s Client Ecosystem

Exploring your data lake in Firebolt using just TVFs

[Dynamic Code Blocks] Exploring your data lake in Firebolt using just TVFs

From Zero to 100M Users: Inside Notion’s Data Stack and AI Strategy with Sumit Gupta

Professors Joe Hellerstein and Joseph Gonzalez on LLMs

Unlocking Simplicity and Security: Firebolt’s New LOCATION Object

How Rising Wave Is Redefining Real-Time Data with Postgres Power

GROUPING SETS as a pure planner rewrite ? Yep - it's possible

The Future of Data Warehousing in the Age of AI: 5 Key Trends from Firebolt Forward

Decomposing Firebolt transactions