Supercharge your Analytics with ClickHouse, v.2

It’s an updated version of the presentation with additional slides.

Author: Vadim Tkachenko, CTO at Percona and Principal Advisor at Altinity. 

Overview: ClickHouse is a real-time analytical database system. Even though they’re only celebrating one year as open source software, it has already proved itself ready for the serious workloads.

Supercharge your analytics with ClickHouse

Author: Vadim Tkachenko, CTO at Percona and Principal Advisor at Altinity. Vadim leads Percona Engineering Team, which focuses on development of Percona software products, technology research and performance evaluations of Percona’s and third-party products.

Overview: ClickHouse is a real-time analytical database system. Even though they’re only celebrating one year as open source software, it has already proved itself ready for the serious workloads.

We talk about ClickHouse in general, some internals and why it is so fast. ClickHouse works in conjunction with MySQL — traditionally weak for analytical workloads — and this presentation demonstrates how to make the two systems work together.

How to build analytics for 100bn logs a month with ClickHouse.

Author: Vadim Tkachenko, CTO at Percona and Principal Advisor at Altinity. Vadim leads Percona Engineering Team, which focuses on development of Percona software products, technology research and performance evaluations of Percona’s and third-party products.

Overview: ClickHouse is a real-time analytical database system. With a celebration of only 1 year as OpenSource, it already proved ready for serious workloads. We talked about ClickHouse in general, some internals and why it is so fast. ClickHouse is complementary to MySQL, which is traditionally weak for analytical workload, we touched how to make two systems work together. 

Big Dataset: All Reddit Comments – Analyzing with ClickHouse

Big Dataset: All Reddit Comments – Analyzing with ClickHouse

Oct 5, 2017   
In this blog, I’ll use ClickHouse and Tabix to look at a new very large dataset for research.

It is an interesting dataset I found recently that has been available since 2015. This is Reddit’s comments and submissions dataset, made possible thanks to Reddit’s generous API. 

Massive Parallel Log Processing with ClickHouse

Massive Parallel Log Processing with ClickHouse

In this blog, I’ll look at how to use ClickHouse for parallel log processing. Below I’ll show how ClickHouse can be used to efficiently perform this task. ClickHouse is attractive because it has multi-core parallel query processing

Nested Data Structures in ClickHouse

In this blog post, we’ll look at nested data structures in ClickHouse for MySQL and how this can be used with PMM to look at queries.