OSA Con 2021 Content is Online. See You in 2022!

OSA Con 2021 Content is Online. See You in 2022!

OSA Con 2021, our first conference dedicated to open source analytics, is a wrap. We had a great conference with 22 talks by 31 presenters. I’m delighted that we could bring together so many different projects and applications under a single roof.

Analyzing Billion Row Datasets with ClickHouse

Presented at Percona Live Austin, May 2019 and at the Webinar, June 5, 2019

By Alexander Zaitsev, Altinity CTO and Robert Hodges, Altinity CEO

Columnar stores like ClickHouse enable users to pull insights from big data in seconds, but only if you set things up correctly. This talk will walk through how to implement a data warehouse that contains 1.3 billion rows using the famous NY Yellow Cab ride data. We’ll start with basic data implementation including clustering and table definitions, then show how to load efficiently. Next, we’ll discuss important features like dictionaries and materialized views, and how they improve query efficiency. We’ll end by demonstrating typical queries to illustrate the kind of inferences you can draw rapidly from a well-designed data warehouse. It should be enough to get you started–the next billion rows is up to you!