Size Matters: Best practices for Trillion Row Datasets on ClickHouse

RECORDED: Wednesday, August 10, 2022
SPEAKERS: Robert Hodges & Altinity Support Engineering

ClickHouse is so fast that virtually any developer can get a sub-second response on tables running into billions of rows. It’s different once you reach data sizes in the hundreds of billions or trillions of rows. This webinar walks you through best practices for designing a schema, loading data, and running queries on very large datasets. Expert tricks like combining events in a single fact table, using aggregation to simulate joins, and using materialized views to “index” interesting events in large fact tables are all covered. We’ll even demonstrate the ideas on a trillion-row test data set. Want to scale your data? This webinar is the place to start.

You can download a copy of the webinar slides here.


Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.