Presented By O'Reilly and Cloudera
Make Data Work
March 28–29, 2016: Training
March 29–31, 2016: Conference
San Jose, CA

Uber, your Hadoop has arrived: Powering intelligence for Uber’s real-time marketplace

Vinoth Chandar (Apache Hudi)
11:50am–12:30pm Wednesday, 03/30/2016
Hadoop Use Cases

Location: 210 A/E
Average rating: ****.
(4.18, 17 ratings)

Prerequisite knowledge

Attendees should have a general understanding of Spark and Hadoop.


Vinoth Chandar explains how Uber revamped its foundational data infrastructure with Hadoop as the source-of-truth data lake and Spark as the de facto processing engine, sharing lessons from the experience. Vinoth provides an overview of the data ecosystem at Uber and details the old and the current data architecture at Uber, discussing some of the unique challenges that influenced them. Vinoth also shares the roadmap ahead around areas such as all-active data architecture, Spark infrastructure, interactive SQL, and a bigger initiative to reduce data latency into Hadoop.

Photo of Vinoth Chandar

Vinoth Chandar

Apache Hudi

Vinoth Chandar is the Co-Creator of the Hudi project at Uber and also PMC/Lead of Apache Hudi (Incubating). Previously, he was a senior staff engineer at Uber, where he led projects across various technology areas like data infrastructure, data architecture & mobile/network performance. Vinoth has keen interest in unified architectures for data analytics and processing. Previously, he was the LinkedIn lead on Voldemort and worked on Oracle Server’s replication engine, HPC, and stream processing.