ClickHouse Data Management Internals — Understanding MergeTree Storage, Merges, and Replication

ClickHouse Data Management Internals — Understanding MergeTree Storage, Merges, and Replication

ClickHouse manages huge amounts of data in MergeTree tables. But how is MergeTree organized in storage? What’s a merge and how does it work? How does ClickHouse replicate data and commands across clusters? And what’s a mutation? Our talk explains these features and more. We’ll even discuss mysteries like hard links and the DDL replication queue. Our answers focus on the knowledge necessary to run ClickHouse efficiently and avoid problems. Bring your curiosity and questions!

Safety First! Using clickhouse-backup for ClickHouse Backup and Restore

Safety First! Using clickhouse-backup for ClickHouse Backup and Restore

Backups protect ClickHouse data in cases ranging from accidentally dropping a database to data centers burning down. In this webinar, we’ll compare available backup options for ClickHouse,  then zero in on clickhouse-backup, a popular open-source project that Altinity maintains. The presentation will discuss how clickhouse-backup works, standard backup and restore commands, and tips for reliable operation in production systems. We’ll end with Q & A. Join us if you love your ClickHouse data and want to ensure it sticks around!

Data Hurdles Podcast – The Fast and the Furious: Altinity CEO Robert Hodges’ ClickHouse Joyride

Data Hurdles Podcast – The Fast and the Furious: Altinity CEO Robert Hodges’ ClickHouse Joyride

Buckle up for a high-octane conversation on tearing up the data highways with ClickHouse. Altinity CEO Robert Hodges takes the wheel to navigate building fast analytics engines that would smoke any legacy database in a street race. Learn how their souped-up columnar database design wrings out blistering acceleration measured in milliseconds. If you crave speed, this adrenaline-filled test drive will leave you breathless. The pedal will be flat to the floor as Hodges pushes ClickHouse to the limits revealing the secrets of lightning-fast time to insight. Your analytics have never moved this fast—it’s ClickHouse or bust!

Keeping Your Cloud Native Data Safe: A Common-Sense Guide to Kubernetes, ClickHouse, and Security

Keeping Your Cloud Native Data Safe: A Common-Sense Guide to Kubernetes, ClickHouse, and Security

Kubernetes is now the default environment for ClickHouse-based analytic platforms. But how to keep your data safe? Our talk is a practical walkthrough of security for ClickHouse on Kubernetes. We’ll show you how to divide the security problem into easily understood pieces. Then we’ll walk through practical ways you can protect data using the Altinity Operator for ClickHouse, Kubernetes Secrets, managed Kubernetes, and more. You don’t have to be a security wizard to protect your ClickHouse data. Common sense and a little organization will do. Watch it now and find out how!

Snowflake, BigQuery, or ClickHouse? Pro Tricks to Build Cost-Efficient Analytics for Any Business

Snowflake, BigQuery, or ClickHouse? Pro Tricks to Build Cost-Efficient Analytics for Any Business

Do you ever look at your bill for Snowflake or BigQuery and just sigh? This talk is for you. We’ll explain how pricing works for popular analytic databases and how to get the best deal. Then we’ll look at how to build an alternative using open-source ClickHouse data warehouses. As the pros say, open source may be free but it ain’t cheap! We’ll teach you the tricks to build your own ClickHouse analytic stack that’s less expensive and faster than Snowflake. Join us to become a wizard of cloud cost management.

Using S3 Storage and ClickHouse: Basic and Advanced Wizardry

Using S3 Storage and ClickHouse: Basic and Advanced Wizardry

S3-compatible object storage can reduce cost and improve flexibility in ClickHouse analytics. This talk helps develop the legerdemain to use it well. We’ll start with standard use patterns like tiered storage and archiving to Parquet. Next, we’ll dig into the details of using object storage in ClickHouse tables: storage policies, authentication, caching, and zero-copy replication. Finally, we’ll present expert tips on topics like S3 cost optimization, designing for mutable data, backup, and others. Join our expert presenters and bring your questions!

Build a Low-Cost, High-Performance Analytic Platform with Kubernetes and Open Source

Build a Low-Cost, High-Performance Analytic Platform with Kubernetes and Open Source

Tired of big bills from Snowflake and BigQuery? Want to keep data in-house? Trying to avoid vendor lock-in? Solve these problems and more by building your own cloud-native analytic service. We start with the architecture of close-source, cloud analytic databases like Snowflake. We then introduce an equally capable design for real-time analytics built entirely on robust open source. Next, we stand up an example using Kubernetes for the run-time, ClickHouse as the query engine, and infrastructure-as-code to deploy apps. Ingest, visualization, and system services are all included. The talk ends with cost numbers to prove that you can operate the service at a fraction of the cost of your current cloud database. (Code used in the platform demo is open source and available on GitHub.) Presenter: Robert Hodges

Using the Schema-Agnostic Design Pattern on ClickHouse for Product Analytics at Sumsub

Using the Schema-Agnostic Design Pattern on ClickHouse for Product Analytics at Sumsub

The Cisco Umbrella team is working with Altinity to stand up a FedRAMP Medium deployment for government customers using Altinity’s FIPS-compatible builds of ClickHouse. Join us to learn how the teams are working together on this critical project.

What is Cisco Umbrella and how does it use ClickHouse?
What are the challenges of bringing up ClickHouse in a FedRAMP environment?
How are Cisco Umbrella and Altinity working together to deploy FIPS-compatible analytics?
What lessons can we share with other users on the same path?

Making the Journey to FedRAMP: Cisco Umbrella, Altinity, and ClickHouse-based Analytics

Making the Journey to FedRAMP: Cisco Umbrella, Altinity, and ClickHouse-based Analytics

The Cisco Umbrella team is working with Altinity to stand up a FedRAMP Medium deployment for government customers using Altinity’s FIPS-compatible builds of ClickHouse. Join us to learn how the teams are working together on this critical project.

What is Cisco Umbrella and how does it use ClickHouse?
What are the challenges of bringing up ClickHouse in a FedRAMP environment?
How are Cisco Umbrella and Altinity working together to deploy FIPS-compatible analytics?
What lessons can we share with other users on the same path?

Fortress ClickHouse: Secure Your Database and Foil Evildoers in 15 Minutes or Less

Fortress ClickHouse: Secure Your Database and Foil Evildoers in 15 Minutes or Less

ClickHouse analytic databases contain sensitive data and need protection from hackers. But where to start? Our webinar shows how to think about ClickHouse security, then walks you through standard methods to secure ClickHouse installations that any developer can apply. For each method, we’ll explain the threat and show how to remediate it, including a demo of the exact steps. We’ll start easy with eliminating plain text passwords, then work up to encryption, network security, certificate management, role-based access control, and file system protections. Implementing security does not need to take long. Bring a stopwatch and see if it’s really just 15 minutes!

Real-time gets real: the shift to fresh analytics data

Real-time gets real: the shift to fresh analytics data

We used to think batch processing saved us money and that real-time data was too difficult and expensive to build and maintain. In recent years, powerful databases like ClickHouse have made real-time analytics affordable and accessible to everyone. Join Tinybird CEO Jorge Sancha and Altinity CEO Robert Hodges as they discuss the shift to real-time data.

In this live stream, we’ll discuss the following with two CEOs at the forefront of the real-time revolution:

What is real-time data, and why has it historically been so hard?
What recent technological developments have made real-time affordable and accessible for all?
What is ClickHouse, and why do Tinybird and Altinity love it so much?
Tinybird and Altinity’s respective visions for real-time data.

Run ClickHouse like a Cheapskate – 6 Ways to Save Money While Delivering Real-Time Analytics

Run ClickHouse like a Cheapskate – 6 Ways to Save Money While Delivering Real-Time Analytics

Everyone knows ClickHouse is fast.
How about cheap?
This talk reveals the secrets of ClickHouse champion cheapskates to reduce costs and still deliver outstanding performance. The list runs from developing apps on laptops with Docker Compose all the way to cost optimization in the public cloud. We’ll also consider application tuning, TTL clauses, tiered storage, caches, and more.
Finally, we’ll preview the roadmap for cost-efficiency improvements, such as better support for object storage.
Our talk is for ClickHouse hackers who want to save big money.
Join us and bring your cost questions!