ClickHouse® and R language

ClickHouse® and R language

Recently Institute for Interactive Marketing and Social Media research team released ClickHouse library for R language. R is a popular language and environment for statistical analysis and visualization. In this article, we try this library by ourselves by a simple example.

ClickHouse® Primary Keys

ClickHouse® Primary Keys

This is a cross-post from: (https://medium.com/@f1yegor/clickhouse-primary-keys-2cf2a45d7324) Special thanks to Alexey Milovidov, ClickHouse developer, for providing material for this article. Recently I dived deep into ClickHouse. ClickHouse is column-store database by Yandex with great performance for analytical queries. For example check…

Compression in ClickHouse®

Compression in ClickHouse®

It might not be obvious from the start, but ClickHouse supports different kinds of compressions, namely two LZ4 and ZSTD.

Custom partitioning in ClickHouse® 1.1.54310

Custom partitioning in ClickHouse® 1.1.54310

ClickHouse introduced a new major feature in version 1.1.54310 – it is a custom partitioning for tables with MergeTree engine. Before it enforced to use Date field for partitioning and the only choice to partition by month.The new version allows…

ClickHouse® Digest #1

ClickHouse® Digest #1

Activity in and around ClickHouse has dramatically increased recent weeks. Percona Live in Dublin, meetups in Bay Area and Berlin, new features, many new companies adopting ClickHouse — all that generate a lot of news and increase interest to ClickHouse…

ClickHouse® release 1.1.54310

ClickHouse® release 1.1.54310

The new deb packages are available from Yandex repo and RPMs from Altinity repo. New features: Custom partitioning key for the MergeTree family of table engines. Kafka table engine. Added support for loading CatBoost models and applying them to data…

Migration to ClickHouse®

Migration to ClickHouse®

Oct 23, 2017   ClickHouse is an excellent analytics database choice not just for startups but also for companies that have already invested significant amount of resources into their analytics solutions, but are not completely satisfied with the results. In this…

ClickHouse® release 1.1.54292

ClickHouse® release 1.1.54292

The new deb packages are available from Yandex repo and RPMs from Altinity repo. New features: Added the pointInPolygon function for working with coordinates on a coordinate plane. Added the sumMap aggregate function for calculating the sum of arrays, similar…

Big Dataset: All Reddit Comments – Analyzing with ClickHouse®

Big Dataset: All Reddit Comments – Analyzing with ClickHouse®

Oct 5, 2017   In this blog, I’ll use ClickHouse and Tabix to look at a new very large dataset for research.It is an interesting dataset I found recently that has been available since 2015. This is Reddit’s comments and submissions…

ClickHouse® release 1.1.54289

ClickHouse® release 1.1.54289

The new deb packages are available from Yandex repo and RPMs from Altinity repo. New features: SYSTEM queries for server administration: SYSTEM RELOAD DICTIONARY, SYSTEM RELOAD DICTIONARIES, SYSTEM DROP DNS CACHE, SYSTEM SHUTDOWN, SYSTEM KILL. Added functions for working with…

Webinar: Supercharge Your Analytics with ClickHouse®

Webinar: Supercharge Your Analytics with ClickHouse®

Speakers: Vadim Tkachenko, Percona’s, CTO and Alexander Zaitsev, Altinity’s Co-Founder.ClickHouse is a real-time analytical database system. Even though they’re only celebrating one year as open source software, it has already proved itself ready for the serious workloads.

Massive Parallel Log Processing with ClickHouse®

Massive Parallel Log Processing with ClickHouse®

In this blog, I’ll look at how to use ClickHouse for parallel log processing. Below I’ll show how ClickHouse can be used to efficiently perform this task. ClickHouse is attractive because it has multi-core parallel query processing