Presented By O'Reilly and Cloudera
Make Data Work
March 13–14, 2017: Training
March 14–16, 2017: Tutorials & Conference
San Jose, CA

Apache Kudu: 1.0 and beyond

Todd Lipcon (Cloudera)
11:00am11:40am Wednesday, March 15, 2017
Hadoop platform and applications
Location: LL21 E/F Level: Intermediate
Secondary topics:  Architecture
Average rating: ****.
(4.75, 4 ratings)

Who is this presentation for?

  • Architects and developers

Prerequisite knowledge

  • General familiarity with the Apache Kudu (incubating) project (no experience contributing to Kudu or programming using the Kudu APIs required)

What you'll learn

  • Learn the latest news from Kudu development and what to expect in upcoming releases
  • Understand the experience of real-life users who have deployed Kudu in production


Apache Kudu was first announced as a public beta release at Strata in NYC 2015 and recently reached 1.0. It has now been a public open source project for more than a year. Todd Lipcon offers a very brief refresher on the goals and feature set of the Kudu storage engine, covering the development that has taken place over the last year, including new features such as improved support for time series workloads, performance improvements, Spark integration, and highly available replicated masters. Along the way, Todd explores real-world production deployments and some of the tools that have been built to help operators manage a Kudu cluster. He ends with a view of the road map of the Kudu project for the upcoming year, including plans for security and other new features.

Photo of Todd Lipcon

Todd Lipcon


Todd Lipcon is an engineer at Cloudera, where he primarily contributes to open source distributed systems in the Apache Hadoop ecosystem. Previously, he focused on Apache HBase, HDFS, and MapReduce, where he designed and implemented redundant metadata storage for the NameNode (QuorumJournalManager), ZooKeeper-based automatic failover, and numerous performance, durability, and stability improvements. In 2012, Todd founded the Apache Kudu project and has spent the last three years leading this team.¬†Todd is a committer and PMC member on Apache HBase, Hadoop, Thrift, and Kudu, as well as a member of the Apache Software Foundation. Prior to Cloudera, Todd worked on web infrastructure at several startups and researched novel machine learning methods for collaborative filtering. Todd holds a bachelor’s degree with honors from Brown University.

Comments on this page are now closed.


Steve Bedeker | IT ARCHITECT
03/15/2017 4:52am PDT

I would also like to have access to the slides.

03/15/2017 4:18am PDT

May I ask for the powerpoint for this session? thanks a lot