Analyzing Billion Row Datasets with ClickHouse

By Altinity Team on June 7th, 2019

Analyzing Billion Row Datasets with ClickHouse

Altinity Team ClickHouseconferencewebinar

 

Presented at Percona Live Austin, May 2019
and at the Webinar, June 5, 2019

By Alexander Zaitsev, Altinity CTO
Robert Hodges, Altinity CEO

For the webinar video visit https://www.altinity.com/webinarspage/2019/6/05/analyzing-billion-row-datasets-with-clickhouse

Columnar stores like ClickHouse enable users to pull insights from big data in seconds, but only if you set things up correctly. This talk will walk through how to implement a data warehouse that contains 1.3 billion rows using the famous NY Yellow Cab ride data. We’ll start with basic data implementation including clustering and table definitions, then show how to load efficiently. Next, we’ll discuss important features like dictionaries and materialized views, and how they improve query efficiency. We’ll end by demonstrating typical queries to illustrate the kind of inferences you can draw rapidly from a well-designed data warehouse. It should be enough to get you started–the next billion rows is up to you!

Leave a Reply

More Presentations

Webinar Slides. ClickHouse Unleashed 2020: Our Favorite New Features for Your Analytical Applications

Curious what’s new in ClickHouse? This talk is for you! Watch to learn about new features of ClickHouse, ranging from tiered storage to compact MergeTree format to SQL join improvements. Preview new features like object storage and LDAP support planned for the remainder of 2020. We close out, by discussing Altinity Stable Releases and how they help you decide when features are ready for production use. Don’t miss this opportunity to get current on ClickHouse improvements to help you build even better analytics!

Webinar Slides. Adventures in Observability: How In-House ClickHouse Deployment Enabled Instana to Build Better Monitoring for Users

Instana is an Application Performance Monitoring solution that provides complete visibility into complex distributed systems, including ClickHouse servers. In this webinar, we welcome Marcel Birkner and Yoann Buch of Instana. They describe how the Instana team uses ClickHouse to power their solution, lessons they have learned, and how they integrated them back into Instana's own ClickHouse monitoring features. Along the way, you'll learn how to set up Instana on ClickHouse, what important metrics you should track, how ClickHouse is integrated in end-to-end distributed tracing, and what level of automation is available for root cause analysis and alerting.

Webinar Slides. Big Data and Beautiful Video: How ClickHouse enables Mux to Deliver Content at Scale

By Adam Brown and Robert Hodges Slides for the Webinar, presented on June 23, 2020 Webinar recording is available here Mux.com enables content providers to stream video to vast audiences while maintaining pin-point control over performance. Join us as Adam Brown, a co-founder of Mux and video expert, explains the role that ClickHouse plays in content delivery. We'll start with an overview of the Mux platform and the importance of real-time feedback. We'll then discuss key features of ClickHouse that enable it to ingest live session data and provide real-time dashboards to Mux users. Finally, we'll talk a bit about the Mux journey to ClickHouse and lessons learned along the way about how data enables content delivery networks.