Open Source

CodeMash 2025 Talk: DevOps is a Foreign Language (or Why There Are No Junior SREs)
DevOps has a notoriously steep learning curve. Getting started in the field can feel like being dropped in a foreign country without the ability to understand *anything* about the language.
A language is more than just the syntax and semantic rules of the words themselves. It also encompasses the shared culture of the speakers. With the proliferation of programming languages as well as the deeply held cultural beliefs of the community, it's easy to see that learning DevOps is like trying to learn a foreign language.
I will review five foundational hypotheses from the field of Second Language Acquisition and relate these hypotheses back to the world of DevOps. DevOps practitioners, trainers, tool builders, and learners should all come away with useful insights to apply to their practice.

What’s a Data Lake and What Does It Mean For My Open Source ClickHouse® Stack?
Data lakes on open table formats are emerging as the go-to storage of large datasets for analytics, data science, and AI. This talk explains how data lakes work and how ClickHouse® is integrating them. We’ll introduce the key components of data lakes using a concrete example based on Parquet, Iceberg open table format, and the Iceberg REST catalog. Next we’ll look at new ClickHouse® feature adaptations, exploring specific issues like event stream ingest, compaction, and queries. Finally, we’ll illustrate how to combine ClickHouse® with Apache Spark and Kafka to deliver fast analytics on massive, shared tables. The real-time data lake is arriving and ClickHouse® is going to be a big part of it. Join us and bring your questions!

Open Source Analytics Community & Altinity at FOSDEM 2025
Come chat with Robert Hodges, Joshua Lee, Alsu Gilizova, and Alkin Tezuysal at FOSDEM 2025.
FOSDEM is a free event for software developers to meet, share ideas, and collaborate. Every year, thousands of developers of free and open-source software from all over the world gather at the event in Brussels.
We're excited to share that we'll be at FOSDEM as part of the Open Source Analytics (OSA) Community. Stop by and chat with us—we'd love to connect.

State of Open Con® 25 London: What’s a Data Lake and What Does It Mean For My Open Source Stack?
The UK's Open Technology Conference, Open Source Software, Open Hardware, Open Data, Open Standards, & AI Openness is happening on the 4th and the 5th of February 2025, in Sancroft, Rose St, Paternoster Sq., St Paul's London EC4M 7DQ.
Our CEO, Robert Hodges, will be presenting the talk:
What’s a Data Lake and What Does It Mean For My Open Source Stack?
Data lakes on open table formats like Iceberg are a popular way to manage large datasets for analytics, data science, and AI. This talk explains how data lakes work and how to adapt open source analytic stacks to use them.

State of Open Con® 25 London: Data Sovereignty in Observability
As OpenTelemetry rapidly becomes the standard for open-source observability instrumentation, the question of data sovereignty takes center stage. In this panel, Josh Lee moderates a lively discussion with Hazel Weakly, Robert Hodges, and Edith Puclla—leaders in the fields of observability, databases, and open source. Together, they will explore the implications of data sovereignty in observability, balancing control, and compliance with innovation. Topics will include the role of open-source tools, the challenges of ensuring data portability, and the future of observability architectures in a sovereignty-conscious world.

Distributed Tracing with ClickHouse® & OpenTelemetry
Have you ever wondered what happens inside a distributed database when you run a query? In this webinar, we’ll explore the inner workings of ClickHouse using OpenTelemetry-compatible distributed tracing.
First, we’ll analyze the internal workings of various ClickHouse clusters while we run a variety of queries.
Then we’ll show how to use OpenTelemetry- and eBPF-based tools to observe queries coming into ClickHouse with context from the client applications, creating comprehensive end-to-end traces.
Finally, we’ll show how ClickHouse itself can be used as a versatile storage layer for trace data, and we’ll connect our data in ClickHouse to trace to visualization tools like Jaeger and Grafana.

Build a Great Business on Open Source without Selling Your Soul
A profitable business is one of the best protections for commercial open-source projects and communities that depend on them.
Our talk draws on the experience of companies that pulled it off to explain how to do it for your own projects. We’ll discuss commercial models that actually work, giving back to the community, and gracefully collecting money for free software. We'll also discuss topics for larger projects like foundations and taking VC funding.
It is possible to balance a strong belief in open-source communities with making payroll every two weeks. We've done it and will share our secrets.

SCaLE X 2025 – Boosting MySQL with Vector Search: Introducing the MyVector Plugin
As the demand for vector databases and Generative AI continues to rise, integrating vector storage and search capabilities into traditional databases has become increasingly important. This session introduces the *MyVector Plugin*, a project that brings native vector storage and similarity search to MySQL. Unlike PostgreSQL, which offers interfaces for adding new data types and index methods, MySQL lacks such extensibility. However, by utilizing MySQL's server component plugin and UDF, the *MyVector Plugin* successfully adds a fully functional vector search feature within the existing MySQL + InnoDB infrastructure, eliminating the need for a separate vector database. The session explains the technical aspects of integrating vector support into MySQL, the challenges posed by its architecture, and real-world use cases that showcase the advantages of combining vector search with MySQL's robust features. Attendees will leave with practical insights on how to add vector search capabilities to their MySQL systems.

FOSSASIA SUMMIT 2025 – Unified Observability: Leveraging ClickHouse as a Comprehensive Telemetry Database
As modern infrastructures become more complex, traditional monitoring tools often struggle to provide a complete understanding of system health. This presentation will explore how ClickHouse, a high-performance columnar database, enables unified observability by consolidating metrics, logs, traces, and eBPF data into a scalable platform.
Attendees will learn how fragmented telemetry systems can hinder root-cause analysis and operational efficiency and how ClickHouse overcomes these challenges by offering seamless integration with tools like OpenTelemetry, Grafana, Prometheus, and more. We will also compare observability solutions, such as Coroot, qryn, and custom stacks with Jaeger and Loki, to demonstrate how ClickHouse simplifies data management, enhances real-time analysis, and supports large-scale telemetry use cases.
This session will provide practical insights into implementing ClickHouse for observability, enabling engineers to focus on system insights while minimizing infrastructure complexity.

ClickHouse® Disaster Recovery: Tips and Tricks to Avoid Trouble in Paradise
ClickHouse usually works like a charm, but what if a real disaster strikes?
This webinar introduces popular ways to protect your ClickHouse cluster and maybe your job, too.
Our talk starts with the main patterns for setting up ClickHouse disaster recovery and shows available techniques like cross-region replication, backups, and parallel clusters. We'll cover a fully worked example of DR and also discuss obvious and non-obvious trade-offs.
Every site is a little different--this webinar will provide you the tools to implement a DR strategy that works for you.

SREDAY 2025 LONDON: Scaling Analytics with ClickHouse® in Cloud-Native Environments
Learn strategies to optimize ClickHouse for massive data streams. Explore Kubernetes orchestration, query optimization, and real-world solutions for scalable, cost-efficient, and reliable analytics in modern systems.
ClickHouse is a high-performance columnar database designed for analytical workloads, but running it efficiently in cloud-native environments presents unique challenges. This talk explores the architecture of ClickHouse and provides strategies for scaling it to handle massive data streams in containerized and distributed systems. We’ll cover topics like volume provisioning, efficient data ingestion pipelines, query optimization, and orchestration with Kubernetes. Real-world examples will demonstrate how to tackle common bottlenecks, improve performance, and reduce costs while maintaining reliability in cloud-native setups. Attendees will leave with actionable insights to build robust, scalable storage and analytics solutions with ClickHouse in modern environments.
SREDAY Conference
Site Reliability, DevOps, and Cloud
March 27-28, London, United Kingdom

SREDAY 2025 San Francisco: Fast, Cheap, DIY Observability with Open Source Analytics and Visualization
Commercial observability tools are expensive and complex - but you can build a fast, cost-effective solution yourself! This talk shows how to use ClickHouse, OpenTelemetry, and Grafana to create scalable, DIY monitoring with open source tools.
Robert Hodges serves as CEO at Altinity, an enterprise provider for ClickHouse. Robert has over 30 years of experience with database systems and applications including pre-relational databases such as M204, online SQL transaction processing, Hadoop, and analytics. In the last few years, his work has focused on analytical databases, Kubernetes, and open source. Robert is the founder of the Open Source Analytics Conference (osason.io).
SREDAY Conference
Site Reliability, DevOps and Cloud
April 11, 2025 San Francisco, CA, USA