Webinar Slides. Big Data in Real-Time: How ClickHouse powers Admiral’s visitor relationships for publishers

By James Hartig and Robert Hodges
Slides for the Webinar, presented on June 16, 2020
Webinar recording is available here

Advertising is dying in the wake of privacy and adblockers. Join us for a conversation with James Hartig, a Co-Founder at Admiral (getadmiral.com), who helps publishers diversify their revenue and build more meaningful relationships with users. We’ll start with an overview of Admiral’s platform and how they use large scale session data to power their engagement engine. We’ll then discuss the ClickHouse features that Admiral uses to power these real-time decisions. Finally, we’ll walk through how Admiral migrated from MongoDB to ClickHouse and some of their plans for future projects. Join us to learn how ClickHouse drives cutting edge real-time applications today!

Webinar Slides. MORE secrets of ClickHouse Query Performance

By Robert Hodges and Alexander Zaitsev
Slides for the Webinar, presented on May 27, 2020
Webinar recording is available here

ClickHouse is famously fast, but a small amount of extra work makes it much faster. Join us for the latest version of our popular talk on single-node ClickHouse performance. We start by examining the system log to see what ClickHouse queries are doing. Then we introduce standard tricks to increase speed: adding CPUs, reducing I/O with filters, restructuring joins, adding indexes, and using materialized views, plus many more. In each case we show how to measure the results of your work. There will as usual be time for questions as well at the end. Sign up now to polish your ClickHouse performance skills!

Webinar Slides. Strength in Numbers: Introduction to ClickHouse Cluster Performance

By Robert Hodges and Alexander Zaitsev
Slides for the Webinar, presented on Apr 29, 2020
Webinar recording is available here

ClickHouse clusters apply the power of dozens or even hundreds of nodes to vast datasets. In this webinar we’ll show you how to use the basic tools of replication and sharding to create high performance ClickHouse clusters. We’ll study the plumbing of inserts into sharded datasets and how to determine the correct number of shards for your desired writes. We’ll similarly look at distributed queries and show how to scale read capacity to desired levels using replicas. Finally, we’ll look at techniques for scaling up both shards and replicas to accommodate growth in your dataset.

Webinar Slides. ClickHouse Monitoring 101: What to monitor and how

By Robert Hodges and Ned McClain
Slides for the Webinar, presented on Apr 1, 2020
Webinar recording is available here

You are about to deploy ClickHouse into production. Congratulations! But what about monitoring? In this webinar we will introduce how to track the health of individual ClickHouse nodes as well as clusters. We’ll describe available monitoring data, how to collect and store measurements, and graphical display using Grafana. We’ll demo techniques and share sample Grafana dashboards that you can use for your own clusters.

Webinar Slides. ClickHouse Materialized Views: The Magic Continues

By Robert Hodges
Slides for the Webinar, presented on Feb 26, 2020
Webinar recording is available here

Materialized views are the killer feature of ClickHouse, and the Altinity 2019 webinar on how they work was very popular. Join this updated webinar to learn how to use materialized views to speed up queries hundreds of times. We’ll cover basic design, last point queries, using TTLs to drop source data, counting unique values, and other useful tricks. Finally, we’ll cover recent improvements that make materialized views more useful than ever.

Webinar Slides. A Practical Introduction to Handling Log Data in ClickHouse

By Robert Hodges and Altinity Engineering Team
Slides for the Webinar, presented on Dec 17, 2019 
Webinar recording is available here

Log messages are one of the most important types of application data. ClickHouse is very good at storing log data; many SaaS applications use it under the covers. In this webinar we will show examples of different application logs and how to design tables to store them. Options include using typed columns, strings, JSON, or key-value pair arrays. We’ll also discuss how materialized columns to improve filter speed, as well as techniques to tune index granularity for wide rows. This is the first in a series of webinars on log management. Future webinars will cover tools to load data as well as techniques to retrieve log information efficiently.

Webinar Slides. Tips and tricks every ClickHouse user should know

By Robert Hodges and Altinity Engineering Team
Slides for the Webinar, presented on Dec 3, 2019 
Webinar recording is available here

New users of ClickHouse love the speed but may run into a few surprises when designing applications. Column storage turns classic SQL design precepts on their heads. This talk shares our favorite tricks for building great applications. We’ll talk about fact tables and dimensions, materialized views, codecs, arrays, and skip indexes, to name a few of our favorites. We’ll show examples of each and also reserve time to handle questions. Join us to take your next step to ClickHouse guruhood!

Clickhouse Capacity Planning for OLAP Workloads

by Mik Kocikowski, CloudFlare
Presented on December ClickHouse Meetup. Dec 3, 2019
Concrete findings and “best practices” from building a cluster sized for 150 analytic queries per second on 100TB of http logs. Topics covered: hardware, clients (http vs native), partitioning, indexing, SELECT vs INSERT performance, replication, sharding, quotas, and benchmarking.