Snowflake, BigQuery, or ClickHouse? Pro Tricks to Build Cost-Efficient Analytics for Any Business

Snowflake, BigQuery, or ClickHouse? Pro Tricks to Build Cost-Efficient Analytics for Any Business

Do you ever look at your bill for Snowflake or BigQuery and just sigh? This talk is for you. We’ll explain how pricing works for popular analytic databases and how to get the best deal. Then we’ll look at how to build an alternative using open-source ClickHouse data warehouses. As the pros say, open source may be free but it ain’t cheap! We’ll teach you the tricks to build your own ClickHouse analytic stack that’s less expensive and faster than Snowflake. Join us to become a wizard of cloud cost management.

Using S3 Storage and ClickHouse: Basic and Advanced Wizardry

Using S3 Storage and ClickHouse: Basic and Advanced Wizardry

S3-compatible object storage can reduce cost and improve flexibility in ClickHouse analytics. This talk helps develop the legerdemain to use it well. We’ll start with standard use patterns like tiered storage and archiving to Parquet. Next, we’ll dig into the details of using object storage in ClickHouse tables: storage policies, authentication, caching, and zero-copy replication. Finally, we’ll present expert tips on topics like S3 cost optimization, designing for mutable data, backup, and others. Join our expert presenters and bring your questions!

Build a Low-Cost, High-Performance Analytic Platform with Kubernetes and Open Source

Build a Low-Cost, High-Performance Analytic Platform with Kubernetes and Open Source

Tired of big bills from Snowflake and BigQuery? Want to keep data in-house? Trying to avoid vendor lock-in? Solve these problems and more by building your own cloud-native analytic service. We start with the architecture of close-source, cloud analytic databases like Snowflake. We then introduce an equally capable design for real-time analytics built entirely on robust open source. Next, we stand up an example using Kubernetes for the run-time, ClickHouse as the query engine, and infrastructure-as-code to deploy apps. Ingest, visualization, and system services are all included. The talk ends with cost numbers to prove that you can operate the service at a fraction of the cost of your current cloud database. (Code used in the platform demo is open source and available on GitHub.) Presenter: Robert Hodges