
- This event has passed.
Petabyte-Scale Data in Real-Time: ClickHouse, S3 Object Storage, and Data Lakes
WEBINAR
Petabyte-Scale Data in Real-Time: ClickHouse, S3 Object Storage, and Data Lakes
22nd May 2024 @ 8:00 am – 9:00 am PDT

These days new ClickHouse applications start with petabyte-sized datasets and scale up from there. Fortunately, ClickHouse gives you open-source tools for real-time analytics on big data: MergeTree backed by object storage as well as reading on data lakes. We’ll start by showing you popular design patterns for ingest, aggregation, and queries on source data. We’ll then dig into specific best practices for defining S3 storage policies, reading from Parquet data, backing up, monitoring, and setting up high-performance clusters in the cloud. It’s all open source and works in any cloud.
About the Presenters

Alexander Zaitsev – CTO at Altinity
Alexander is a co-founder of Altinity. He has 20 years of engineering and engineering management experience in several international companies. Alexander is an expert in high scale analytics systems design and implementation. He designed and deployed petabyte-scale data warehouses, including one of the earliest ClickHouse deployments outside of Yandex.

Robert Hodges – CEO at Altinity
Robert has more than 30 years of experience with database systems and applications including pre-relational databases such as M204, online SQL transaction processing, Hadoop, and analytics. In the last few years, his work has focused on analytical databases, Kubernetes, and open source.
Want more great content? Subscribe to our newsletter and get all ClickHouse content straight to your inbox.