Clickhouse Capacity Planning for OLAP Workloads

by Mik Kocikowski, CloudFlare
Presented on December ClickHouse Meetup. Dec 3, 2019
Concrete findings and “best practices” from building a cluster sized for 150 analytic queries per second on 100TB of http logs. Topics covered: hardware, clients (http vs native), partitioning, indexing, SELECT vs INSERT performance, replication, sharding, quotas, and benchmarking.

Webinar Slides. Adding Fast Analytics to MySQL Applications with Clickhouse

By Robert Hodges and Altinity Engineering Team
Slides for the Webinar, presented on Octo 10, 2019 
Webinar recording is available here

Successful MySQL applications have a very common scale problem: how to provide analytics when data grows too large for MySQL to handle. Fortunately, there is a good answer. ClickHouse is a popular, open source column store you can use to add fast analytics to MySQL applications. We’ll start with an introduction to ClickHouse for MySQL users showing familiar SQL syntax and table engine design. We’ll then walk through important features like support for MySQL wire protocol, ability to read data directly from MySQL tables, and different ways to move data from MySQL into ClickHouse. Finally, we’ll touch on design approaches that allow you to extend MySQL applications with analytics that can handle billions of rows.

Webinar Slides. Secrets of ClickHouse Query Performance

By Robert Hodges and Altinity Engineering Team
Slides for the Webinars, presented on Sept 11 and Sept 17, 2019 
Webinar recording is available here

ClickHouse is famous for speed. That said, you can almost always make it faster! This webinar uses examples to teach you how to deduce what queries are actually doing by reading the system log and system tables. We’ll then explore standard ways to increase query speed: data types and encodings, filtering, join reordering, skip indexes, materialized views, session parameters, to name just a few. In each case we’ll circle back to query plans and system metrics to demonstrate changes in ClickHouse behavior that explain the boost in performance. We hope you’ll enjoy the first step to becoming a ClickHouse performance guru!

Webinar Slides: Introduction to the Mysteries of ClickHouse Replication

By Robert Hodges and Altinity Engineering Team
Slides for the Webinar, presented on July 31, 2019 
Webinar recording is available here

Built-in replication is a powerful ClickHouse feature that helps scale data warehouse performance as well as ensure high availability. This webinar will introduce how replication works internally, explain configuration of clusters with replicas, and show you how to set up and manage ZooKeeper, which is necessary for replication to function. We’ll finish off by showing useful replication tricks, such as utilizing replication to migrate data between hosts. Join us to become an expert in this important subject!

Webinar Slides: ClickHouse and the Magic of Materialized Views

By Robert Hodges and Altinity Engineering Team
Slides for the Webinar, presented on June 26, 2019 
Webinar recording is available here

Materialized views are a killer feature of ClickHouse that can speed up queries 200X or more. Our webinar will teach you how to use this potent tool starting with how to create materialized views and load data. We’ll then walk through cookbook examples to solve practical problems like deriving aggregates that outlive base data, answering last point queries, and using AggregateFunctions to handle problems like counting unique values, which is a special ClickHouse feature. There will be time for Q&A at the end. At that point you’ll be a wizard of ClickHouse materialized views and able to cast spells of your own.