Analyzing Billion Row Datasets with ClickHouse

Presented at Percona Live Austin, May 2019 and at the Webinar, June 5, 2019

By Alexander Zaitsev, Altinity CTO and Robert Hodges, Altinity CEO

Columnar stores like ClickHouse enable users to pull insights from big data in seconds, but only if you set things up correctly. This talk will walk through how to implement a data warehouse that contains 1.3 billion rows using the famous NY Yellow Cab ride data. We’ll start with basic data implementation including clustering and table definitions, then show how to load efficiently. Next, we’ll discuss important features like dictionaries and materialized views, and how they improve query efficiency. We’ll end by demonstrating typical queries to illustrate the kind of inferences you can draw rapidly from a well-designed data warehouse. It should be enough to get you started–the next billion rows is up to you!

Webinar Slides: ClickHouse on Kubernetes!

Slides for the Webinar, presented on April 16, 2019 
For the webinar video visit https://altinity.com/webinarspage/2019/3/7/a-fast-intro-to-fast-query-with-clickhouse

Data services are the latest wave of applications to catch the Kubernetes bug. Altinity is pleased to introduce the ClickHouse operator, which makes it easy to run scalable data warehouses on your favorite Kubernetes distro. This webinar shows how to install the operator and bring up a new data warehouse in three simple steps. We also cover storage management, monitoring, making config changes, and other topics that will help you operate your data warehouse successfully on Kubernetes. There is time for demos and Q&A, so bring your questions. See you online!

Webinar Slides: A Fast Intro to Fast Query with ClickHouse

Slides for the Webinar, presented on March 6, 2019 
For the webinar video click here

Extracting business insight from massive pools of machine-generated data is the central analytic problem of the digital era. ClickHouse data warehouse addresses it with sub-second SQL query response on petabyte-scale data sets. In this talk we’ll discuss the features that make ClickHouse increasingly popular, show you how to install it, and teach you enough about how ClickHouse works so you can try it out on real problems of your own. We’ll have cool demos (of course) and gladly answer your questions at the end.