
- This event has passed.
What’s a Data Lake and What Does It Mean For My Open Source ClickHouse® Stack?
WEBINAR
What’s a Data Lake and What Does It Mean For My Open Source ClickHouse® Stack?
January 22 @ 8:00 am – 9:00 am Pacific Time (US and Canada)

Data lakes on open table formats are emerging as the go-to storage of large datasets for analytics, data science, and AI. This talk explains how data lakes work and how ClickHouse® is integrating them. We’ll introduce the key components of data lakes using a concrete example based on Parquet, Iceberg open table format, and the Iceberg REST catalog. Next we’ll look at new ClickHouse® feature adaptations, exploring specific issues like event stream ingest, compaction, and queries. Finally, we’ll illustrate how to combine ClickHouse® with Apache Spark and Kafka to deliver fast analytics on massive, shared tables. The real-time data lake is arriving and ClickHouse® is going to be a big part of it. Join us and bring your questions!
The webinar has ended, but because we love you here is the on-demand video of the webinar:
About the Presenter

Robert Hodges – CEO at Altinity
Robert has more than 30 years of experience with database systems and applications including pre-relational databases such as M204, online SQL transaction processing, Hadoop, and analytics. In the last few years, his work has focused on analytical databases, Kubernetes, and open source.
Want more great content? Subscribe to our newsletter and get all ClickHouse content straight to your inbox.