2023 Sessions On-Demand

#

All Sessions

A Tale of Three Real-Time OLAP Databases

Neha Pawar

Session Speaker
StarTree
Founding Engineer, StarTree
Neha Pawar is a Founding Engineer at StarTree (https://www.startree.ai/), which aims to democratize data for all users by providing real-time, user-facing analytics. Prior to this, she was part of LinkedIn's Data Analytics Infrastructure org for 5 years, working on Apache Pinot & ThirdEye. She is passionate about big data technologies and real-time analytics databases.

Neha is an Apache Pinot PMC and Committer. She has made numerous impactful contributions to Apache Pinot, with a focus on real-time streaming integrations and ingestion. She actively fosters the growing Apache Pinot community & loves to evangelize Pinot by making entertaining video tutorials & illustrations.

When not sipping Pinot, you can find Neha jamming with her husband, painting or hiking with her dogs.

Chinmay Soman

Session Speaker
StarTree
Founding Engineer, StarTree
Chinmay Soman is a founding engineer at StarTree, building real-time analytics solutions at scale. Previously he led the streaming platform team at Uber for building a large-scale, self-serve platform around messaging, stream processing and OLAP technologies. Before that, he worked at LinkedIn and IBM, focusing on distributed systems and security. He’s a PMC member of Apache Samza and a committer on Apache Pinot, Voldemort, uReplicator and AthenaX.

Open source OLAP databases are getting increasingly popular in the last few years and have seen mainstream adoption in several big companies such as LinkedIn, Uber, Netflix, Cisco, Bloomberg and so on. Choosing the right OLAP database depends on several factors such as use case, ingestion constraints and methodology, query performance, and production readiness to name a few. In this talk, we present a high-level comparison of 3 such open-source OLAP databases: Apache Pinot, Apache Druid, and Clickhouse with these dimensions in mind. In addition, we will also look at certain salient features in-depth to uncover how things work under the hood across different systems.