Podcast: Combining Python And SQL To Build A PyData Warehouse

Podcast: Combining Python And SQL To Build A PyData Warehouse

Sept 5, 2019
Robert Hodges on The Python Podcast with Tobias Macey

The ecosystem of tools and libraries in Python for data manipulation and analytics is truly impressive, and continues to grow. There are, however, gaps in their utility that can be filled by the capabilities of a data warehouse. In this episode Robert Hodges discusses how the PyData suite of tools can be paired with a data warehouse for an analytics pipeline that is more robust than either can provide on their own. This is a great introduction to what differentiates a data warehouse from a relational database and ways that you can think differently about running your analytical workloads for larger volumes of data.

New Altinity Stable ClickHouse 19.11.8 Release Is Out!

New Altinity Stable ClickHouse 19.11.8 Release Is Out!

It has been quite a while since we announced the previous ‘Altinity Stable’ ClickHouse in December 2018. Since then there have been a lot of changes and new features in ClickHouse. The core team has merged almost 1000 pull requests, and 217 contributors completed about 6000 commits. Unfortunately, during those months of active development ClickHouse suffered from stability issues. We have been working hard together with the core ClickHouse team to nail them down. Today we are happy to introduce a new ‘Altinity Stable’ release 19.11.8!

Podcast: My Future Business Interview with Robert Hodges

Podcast: My Future Business Interview with Robert Hodges

July 24, 2019

If you’re a business that manages a lot of commercial data, and you need a solution that reliably stores and analyzes that information, then this call is for you!

Robert Hodges, Altinity CEO with Rick Nuske, My Future Business

Podcast: Scale Your Analytics On The Clickhouse Data Warehouse

Podcast: Scale Your Analytics On The Clickhouse Data Warehouse

July 8, 2019
Robert Hodges and Alexander Zaitsev on Data Engineering Podcast with Tobias Macey

The market for data warehouse platforms is large and varied, with options for every use case. ClickHouse is an open source, column-oriented database engine built for interactive analytics with linear scalability. In this episode Robert Hodges and Alexander Zaitsev explain how it is architected to provide these features, the various unique capabilities that it provides, and how to run it in production. It was interesting to learn about some of the custom data types and performance optimizations that are included.

Analyzing Billion Row Datasets with ClickHouse

Presented at Percona Live Austin, May 2019 and at the Webinar, June 5, 2019

By Alexander Zaitsev, Altinity CTO and Robert Hodges, Altinity CEO

Columnar stores like ClickHouse enable users to pull insights from big data in seconds, but only if you set things up correctly. This talk will walk through how to implement a data warehouse that contains 1.3 billion rows using the famous NY Yellow Cab ride data. We’ll start with basic data implementation including clustering and table definitions, then show how to load efficiently. Next, we’ll discuss important features like dictionaries and materialized views, and how they improve query efficiency. We’ll end by demonstrating typical queries to illustrate the kind of inferences you can draw rapidly from a well-designed data warehouse. It should be enough to get you started–the next billion rows is up to you!