Deep Dive on ClickHouse Sharding and Replication Webinar

ClickHouse works out of the box on a single machine, but it gets complicated in a cluster. ClickHouse provides two scaling-out dimensions -- sharding and replication -- and understanding when and how those should be applied is vital for production and high-load applications. In this webinar we will focus on ClickHouse cluster setup and configuration, cluster operation in public clouds, executing distributed queries, hedged requests, and more.

ClickHouse Performance Master Class – Tools and Techniques to Speed up any ClickHouse App Webinar

ClickHouse gives impressive performance out of the box. Our webinar will show you how to make it amazing. We start by providing a framework for performance that includes basic drivers like I/O and compute, as well as ClickHouse structures like compression and indexes. We'll also discuss tools to evaluate performance including ClickHouse system tables and EXPLAIN. We'll then demonstrate how to evaluate and improve performance for common query use cases ranging from MergeTree data on block storage to Parquet files in data lakes. Join our webinar to become a master at diagnosing query bottlenecks and curing them quickly.

DATA SUMMIT 2024

Database Trends & Applications is excited to bring our community of data professionals together this May in Boston for 3 days of practical advice, inspiring thought leadership, and in-depth training. Join your peers to learn, share, and celebrate the trends and technologies shaping the future of data. And, see where the world of big data and data science is going and how to get there first.
Our CEO, Robert Hodges is one of the speakers at this conference and his talk will reveal all on Elevate Your Data: Real-time Analytics on Kubernetes via ClickHouse (just changed title, but the same as Building Real-time Analytics on Kubernetes with the ClickHouse Operator).

Petabyte-Scale Data in Real-Time: ClickHouse, S3 Object Storage, and Data Lakes

These days new ClickHouse applications start with petabyte-sized datasets and scale up from there. Fortunately, ClickHouse gives you open-source tools for real-time analytics on big data: MergeTree backed by object storage as well as reading on data lakes. We'll start by showing you popular design patterns for ingest, aggregation, and queries on source data. We'll then dig into specific best practices for defining S3 storage policies, reading from Parquet data, backing up, monitoring, and setting up high-performance clusters in the cloud. It's all open source and works in any cloud. Join us!