ClickHouse Data Warehouse
Open Source Meets SQL Column Stores
ClickHouse is the first open source SQL data warehouse to match the performance, maturity, and scalability of proprietary databases like Sybase IQ, Vertica, and Snowflake. State-of-the-art features include:
- Column storage that handles tables with trillions of rows and thousands of columns
- Compression and codecs to reduce I/O dramatically
- Linear scaling through vectorized query and sharding
- Fault-tolerance and read scaling thanks to built-in replication
- Fast ingestion with data instantly queryable after INSERT
- Outstanding aggregation through materialized views
- Features to solve real-world problems like funnel analytics and last point queries
ClickHouse development is driven by a community consisting of hundreds of contributors focused on solving real problems, not implementing corporate roadmaps.
Low Latency Analytics, Everywhere
ClickHouse excels at business problems that require consistent low-latency response across petabyte tables. It can handle data arrival rates of millions of rows per second. Achievable response times include:
- Ad-Hoc queries on source data: 1 second or less
- Queries on aggregates: 10 milliseconds or less
- Ingestion to query response: 500 milliseconds
ClickHouse dominates in use cases like web analytics, real-time network management, service log analysis, real-time ad bidding, asset valuation in financial markets, and security threat identification.
Thanks to Apache 2.0 licenses for ClickHouse and ecosystem projects, developers can build innovative analytics without spending a cent for proprietary licenses or expensive cloud services. However, there’s a lot more to make a developer happy:
- Builds that install in 60 seconds or less
- Run ClickHouse and your entire analytic stack in Kubernetes
- Simple integration with Kafka for rapid data ingestion
- A wealth of features designed to solve problems like funnel analysis, last point queries, and pivoting results
- Ability to read directly from S3, MySQL, PostgresQL, Redis, and ODBC data sources
- Tight integration with popular visualization tools like Grafana, Superset, and Tableau.
Unrivaled Cost-Efficiency and Flexibility
ClickHouse runs anywhere that Linux does: public clouds, Kubernetes, and on-prem environments. Resource stingy algorithms keep operating costs low while maximizing performance on available hardware. The permissive Apache 2.0 license makes ClickHouse an ideal choice for applications ranging from multi-tenant cloud services to embedded applications.