ClickHouse Data Management Internals — Understanding MergeTree Storage, Merges, and Replication

ClickHouse manages huge amounts of data in MergeTree tables. But how is MergeTree organized in storage? What's a merge and how does it work? How does ClickHouse replicate data and commands across clusters? And what's a mutation? Our talk explains these features and more. We'll even discuss mysteries like hard links and the DDL replication queue. Our answers focus on the knowledge necessary to run ClickHouse efficiently and avoid problems. Bring your curiosity and questions!

ClickHouse and Apache Parquet: Past, Present, and Future

Apache Parquet, a popular columnar storage format, has become a linchpin for data-driven organizations seeking efficient and scalable solutions.
ClickHouse can read and write Parquet data at very high speeds which allows interesting integration capability.
In this webinar, we will walk through the existing ClickHouse functionality, give an overview of performance improvements over the last year, and discuss how ClickHouse can do even better.
Presenter: Alexander Zaitsev - Altinity co-founder and CTO. ClickHouse expert since 2016

Eureka! 8 developer tricks for running ClickHouse on Kubernetes

Kubernetes is a great platform for ClickHouse with many success stories. What are the tricks that make it possible? This webinar shows 8 practices to help ClickHouse developers build faster and more cost-efficient analytics on Kubernetes. We'll start with using the Altinity operator, work our way through various techniques to scale up or shut off compute, and end with advice on zero-downtime upgrade. Along the way we'll review key Kubernetes features like node autoscaling. Join us to learn more about data on Kubernetes and impress your friends!

Deep Dive on ClickHouse Sharding and Replication Webinar

ClickHouse works out of the box on a single machine, but it gets complicated in a cluster. ClickHouse provides two scaling-out dimensions -- sharding and replication -- and understanding when and how those should be applied is vital for production and high-load applications. In this webinar we will focus on ClickHouse cluster setup and configuration, cluster operation in public clouds, executing distributed queries, hedged requests, and more.