ClickHouse release 1.1.54292

  The new deb packages are available from Yandex repo and RPMs from Altinity repo. New features: Added the pointInPolygon function for working with coordinates on a coordinate plane. Added the sumMap aggregate function for calculating the sum of arrays, similar to SummingMergeTree. Added the trunc function. Improved performance of the rounding functions (round, floor,[…]

Big Dataset: All Reddit Comments – Analyzing with ClickHouse

Big Dataset: All Reddit Comments – Analyzing with ClickHouse

Oct 5, 2017   
In this blog, I’ll use ClickHouse and Tabix to look at a new very large dataset for research.

It is an interesting dataset I found recently that has been available since 2015. This is Reddit’s comments and submissions dataset, made possible thanks to Reddit’s generous API. 

ClickHouse release 1.1.54289

  The new deb packages are available from Yandex repo and RPMs from Altinity repo. New features: SYSTEM queries for server administration: SYSTEM RELOAD DICTIONARY, SYSTEM RELOAD DICTIONARIES, SYSTEM DROP DNS CACHE, SYSTEM SHUTDOWN, SYSTEM KILL. Added functions for working with arrays: concat, arraySlice, arrayPushBack, arrayPushFront, arrayPopBack, arrayPopFront. Added the root and identity parameters for[…]

Webinar: Supercharge Your Analytics with ClickHouse

Webinar: Supercharge Your Analytics with ClickHouse

Speakers: Vadim Tkachenko, Percona’s, CTO and Alexander Zaitsev, Altinity’s Co-Founder.

ClickHouse is a real-time analytical database system. Even though they’re only celebrating one year as open source software, it has already proved itself ready for the serious workloads.

Massive Parallel Log Processing with ClickHouse

Massive Parallel Log Processing with ClickHouse

In this blog, I’ll look at how to use ClickHouse for parallel log processing. Below I’ll show how ClickHouse can be used to efficiently perform this task. ClickHouse is attractive because it has multi-core parallel query processing

ClickHouse New Releases and ChangeLog

It took a while for Yandex to respond to numerous community requests regarding release notes. But finally it’s there. On August 21-st Yandex created a changelog file covering releases starting from 1.1.54245, and they keep it up to date so far.

Nested Data Structures in ClickHouse

In this blog post, we’ll look at nested data structures in ClickHouse for MySQL and how this can be used with PMM to look at queries.

Who and Why is Using ClickHouse

In this article we discover several companies that are using ClickHouse in production for different use cases. 

ClickHouse 1.1.54245 release

  Few weeks ago Yandex released ClickHouse 1.1.54245 that included a number of features requested by community. The new deb packages are available from Yandex repo and RPMs from Altinity repo. So let’s see what’s new in this release. New features Distributed DDLs (see CREATE TABLE ON CLUSTER ) New table engine Dictionary that allows[…]

ClickHouse vs Amazon RedShift Benchmark #2: STAR2002 dataset

Jul 3, 2017   
We continue to benchmark ClickHouse against other analytic DBMSs. We were inspired by the benchmark with star2002 experiment dataset described here, and decided to replicate it using ClickHouse. That gives another interesting comparison vs Amazon RedShift.