Big Data Analysis in Digital Marketing Research

Dec 6, 2017Christian Hotz-Behofsits, Teaching & Research Associate at Vienna University of Business and Economics, is one of the creators of RClickhouse package for R that we have recently introduced on our blog. In this article he describes data analysis…

MySQL to ClickHouse data migration and replication

Altinity has developed an open source clickhouse-mysql tool that helps to migrate existing MySQL table to ClickHouse and setup realtime replication of changes (INSERTS only). Here we review necessary steps using a simple example.

ClickHouse and R language

ClickHouse and R language

Recently Institute for Interactive Marketing and Social Media research team released ClickHouse library for R language. R is a popular language and environment for statistical analysis and visualization. In this article, we try this library by ourselves by a simple example.

ClickHouse Primary Keys

This is a cross-post from: (https://medium.com/@f1yegor/clickhouse-primary-keys-2cf2a45d7324) Special thanks to Alexey Milovidov, ClickHouse developer, for providing material for this article. Recently I dived deep into ClickHouse. ClickHouse is column-store database by Yandex with great performance for analytical queries. For example check…

Compression in ClickHouse

It might not be obvious from the start, but ClickHouse supports different kinds of compressions, namely two LZ4 and ZSTD.

Custom partitioning in ClickHouse 1.1.54310

ClickHouse introduced a new major feature in version 1.1.54310 – it is a custom partitioning for tables with MergeTree engine. Before it enforced to use Date field for partitioning and the only choice to partition by month.The new version allows…

ClickHouse Digest #1

Activity in and around ClickHouse has dramatically increased recent weeks. Percona Live in Dublin, meetups in Bay Area and Berlin, new features, many new companies adopting ClickHouse — all that generate a lot of news and increase interest to ClickHouse…

ClickHouse release 1.1.54310

  The new deb packages are available from Yandex repo and RPMs from Altinity repo. New features: Custom partitioning key for the MergeTree family of table engines. Kafka table engine. Added support for loading CatBoost models and applying them to…

Migration to ClickHouse

Oct 23, 2017   ClickHouse is an excellent analytics database choice not just for startups but also for companies that have already invested significant amount of resources into their analytics solutions, but are not completely satisfied with the results. In this…

ClickHouse release 1.1.54292

  The new deb packages are available from Yandex repo and RPMs from Altinity repo. New features: Added the pointInPolygon function for working with coordinates on a coordinate plane. Added the sumMap aggregate function for calculating the sum of arrays,…

Big Dataset: All Reddit Comments – Analyzing with ClickHouse

Big Dataset: All Reddit Comments – Analyzing with ClickHouse

Oct 5, 2017   In this blog, I’ll use ClickHouse and Tabix to look at a new very large dataset for research.It is an interesting dataset I found recently that has been available since 2015. This is Reddit’s comments and submissions…

ClickHouse release 1.1.54289

  The new deb packages are available from Yandex repo and RPMs from Altinity repo. New features: SYSTEM queries for server administration: SYSTEM RELOAD DICTIONARY, SYSTEM RELOAD DICTIONARIES, SYSTEM DROP DNS CACHE, SYSTEM SHUTDOWN, SYSTEM KILL. Added functions for working…