Loading Events

« All Events

Data Con LA 2025: What’s a Data Lake and What Does It Mean For My Open Source Stack?

CONFERENCE TALK

Data Con LA 2025: What’s a Data Lake and What Does It Mean For My Open Source Stack?

November 8 All day

Our CEO, Robert Hodges, is presenting the talk What’s a Data Lake and What Does It Mean For My Open Source Stack? at the Data CON LA 2025 Conference. The 2025 conference will be held on November 8 at California State University, Long Beach, USA.

Data lakes on open table formats are emerging as the go-to storage of large datasets for analytics, data science, and AI. This talk explains how data lakes work and how ClickHouse® is integrating them.

We’ll introduce the key components of data lakes using a concrete example based on Parquet, Iceberg open table format, and the Iceberg REST catalog. Next, we’ll look at new ClickHouse® feature adaptations, exploring specific issues like event stream ingest, compaction, and queries.

Finally, we’ll illustrate how to combine ClickHouse® with Apache Spark and Kafka to deliver fast analytics on massive, shared tables. The real-time data lake is arriving and ClickHouse® is going to be a big part of it.

About the Presenter

Robert Hodges – CEO at Altinity

Robert Hodges serves as CEO at Altinity, a leading software and services provider for ClickHouse. Robert has more than 30 years of experience with database systems and applications, including pre-relational databases such as M204, online SQL transaction processing, Hadoop, and analytics. In the last few years, his work has focused on analytical databases, Kubernetes, and open source.

Want more great content? Subscribe to our newsletter and get all ClickHouse content straight to your inbox.