When it comes to data management, have we come a long way since the early 2000s?
With the data ingested, let’s delve right into two popular frameworks to visualizing the data.
This guide will provide you with the fundamental knowledge necessary to handle semi-structured data effectively.
In this blog, we focus on distributed query execution as an integral part of Firebolt.
How good you are at Spark or Flink ≠ how good you are at data engineering. Zach Wilson explains.
dbt data quality - Implementing data quality tests and using dbt extensions for enhanced data quality checks.
How ZipRecruiter and Yotpo build resilient self-service products that keep customers happy and engineers calm
In a recent workshop, 25 data pros working in the Ad Tech industry discussed querying large data sets efficiently
At Firebolt, we found out that a duet of dbt and Paradime works for our needs.
Barr Moses explains how to make sure your data is accurate in a world where so many different teams are accessing it
Writing a small data app using the Firebolt JDBC drive.
Looking at GithubArchive dataset of public events - leveraging Apache Airflow workflows for keeping our data up-to-date.
In this blog we will discover the data using Streamlit and Jupyter and the Firebolt Python SDK.
Writing a data app, using Streamlit and Jupyter and the Firebolt Python SDK. A multi-series blog.
Event streams have always been problematic to analyze in SQL. This is how we do it.
Amplitude's cutting-edge data stack and how it processes 5 Trillion real-time events while dealing with mutable data
Data apps are applications that rely heavily on data and have an easy to use.
AWS re:invent 2022 was all about building the anticipation and delivering on expectations of us technologists.
80% of the code that you write doesn’t work on the first try. But knowing which 80% is not working is the real challenge
How to ingest, store and query JSON data, for example, is a consistent question on the minds of customers.
Is Postgres truly the right engine for analytics?
Data Mesh is hot stuff. But from a technology perspective it’s still not very well defined.
In our recent ‘Big Data Analytics for Gaming Workshop’ we let the audience do the talking, here’s a summary of the talk.
Sudeep Kumar, Principal Engineer at Salesforce considers the shift to Clickhouse as one of his biggest accomplishments
"When I see David Jayatillake and Tristan Handy comment on Firebolt's approach it is clear that Firebolt is on track."
Max walks the Bros through his recipe for a smart data-driven company, and the genesis of Airflow, Superset & Presto.
Firebolt provides an alternative to Druid, delivering fast response times, high concurrency and the convenience of a Saa
In this post, we look at factors to consider when building a data warehouse.
According to Yoav Shmaria, VP R&D Platform at Similarweb, the best way to manage data warehouse costs is tagging
How to Set Up Your Data Analytics Stack with Kafka, Hevo, and Firebolt.
Are you spending more than you planned on your Data Warehouse? Analyze more. Use less compute resources.
How to enable sub-second analysis across billions of rows of customer behavior data: Part I - Setting up the load
Klarna is one of the leading fintech companies in the world, valued at $45B.
An episode about Eventbrite’s data stack modernization process, and how you get engineers to adopt new technologies
One of the ways Firebolt is able to support data-driven applications is by leveraging aggregating indexes on the tables.
How the data platform evolved as Slack grew from a startup to an IPOed and then acquired company.
Should data engineering AND BI be handled by the same people?
Why would you create ugly data? According to Jens Larsson, don’t even go near raw data.
Ananth Packkildurai is Principal Software Engineer at Zendesk and runs one of the strongest newsletters in data
The data warehousing market has gone absolutely mad over performance. Why is this the case?
Many programming languages are imperative – tell the compiler how to operate by providing the instructions in order.
Demand from engineering teams has skyrocketed since Firebolt emerged from stealth last year
Gong manages hundreds of thousands of videoconferences and millions of emails PER DAY, which add up to hundreds of TBs.
Bolt engineers are in the midst of designing a new next-gen data platform
Indexes are the primary way for users to accelerate query performance in Firebolt. Learn about them here.
Scaling a data platform to support 1.5T events per day requires complicated technical migrations
Everything you needed to know about cloud data warehouses but were afraid to ask...
Learn when to use Postgres, MySQL, in-memory databases, HTAP, or data warehouses to meet the 1 sec SLA in analytics.
It’s the mother of all development projects. You use it daily. And so do 65M developers around the world.
Lear the top 10 tips of how to improve your cloud data warehouse performance.
How does a tech stack that always needs to be at the forefront of technology look like?
More and more, people are asking me “how do you compare Snowflake and Databricks?” We did our best to answer.
How Vimeo handles Data Ops to deal with massive scale?
How does Substack's data platform support 500K paying subscribers?
Steven Moy thoroughly explains Yelp’s data architecture under the hood and how it evolved over the past ten years.
Canva is one of the hottest, if not the hottest, graphic design platforms out there.
Appsflyer deals not only with 120 billion events per day, but does so while growing quickly as a company
Upstart cloud data warehouse sees rapid growth in 2021, plans to double its workforce
Amazon Athena engine version 2 - what’s new and big enough to call this a 2.0 release?
Making sense of a data lakes, delta lake, lakehouse, data warehouse and more.
Working with semi-structured data can be more like a Jason (horror movie) Sequel than JSON SQL.
Explore the significant differences between ELT and ETL data integration processes and find the best option for you.
How to accelerate Looker performance on Redshift, Snowflake and BigQuery? Short-term fixes and the long-term solutions.
When do you need to shift from Redshift, and what are the alternatives? Learn here.
Learn how to upgrade from Tableau extracts to Tableau live connection to deliver sub-seconds performance every time.
If you’re using Amazon Athena, you may have seen these errors. About AWS Athena errors and how to deal with them.
A detailed comparison of Snowflake vs. Redshift, by architecture, scalability, performance, use cases and cost.
Learn some simple rules of thumb you can use to choose the best federated query engine for your company's needs.
How companies should avoid creating a slow many headed federated Gorgon out of out of Athena.
Why even simple queries can be slow in cloud data warehouses and how Firebolt uses indexing to prune data and stay fast?
How to support ad hoc analysis - Part 2: The right ad hoc analytics architecture
How to support ad hoc analysis: Part 1 - The 4 requirements for an ad hoc analytics architecture
"In the beginning, there was a data mess". Don’t Panic, just read our data hitchhiker’s guide to cloud analytics.