Presented By O'Reilly and Cloudera
Make Data Work
September 25–26, 2017: Training
September 26–28, 2017: Tutorials & Conference
New York, NY

Geospatial big data analysis at Uber

Zhenxiao Luo (Uber), Wei Yan (Uber)
11:20am12:00pm Wednesday, September 27, 2017
Data engineering, Data Engineering & Architecture
Location: 1A 23/24 Level: Intermediate
Secondary topics:  Geospatial, Logistics, Platform
Average rating: ****.
(4.43, 7 ratings)

Who is this presentation for?

  • Software engineers, data scientists, and researchers

What you'll learn

  • Learn how Uber runs geospatial analysis efficiently in its big data systems

Description

Uber’s geospatial data is increasing exponentially as the company grows. As a result, its big data systems must also grow in scalability, reliability, and performance to support business decisions, user recommendations, and experiments for geospatial data. Zhenxiao Luo and Wei Yan explain how Uber runs geospatial analysis efficiently in its big data systems, including Hadoop, Hive, and Presto.

Zhenxiao and Wei start with an overview of Uber’s big data infrastructure before explaining how Uber models geospatial data and outlining its data ingestion pipeline. They then discuss geospatial query performance improvement techniques and experiences, focusing on geospatial data processing in big data systems, including Hadoop and Presto. Zhenxiao and Wei conclude by sharing Uber’s use cases and roadmap.

Photo of Zhenxiao Luo

Zhenxiao Luo

Uber

Zhenxiao Luo is an engineering manager at Uber, where he runs the interactive analytics team. Previously, he led the development and operations of Presto at Netflix and worked on big data and Hadoop-related projects at Facebook, Cloudera, and Vertica. He holds a master’s degree from the University of Wisconsin-Madison and a bachelor’s degree from Fudan University.

Wei Yan

Uber

Wei Yan is a senior engineer at Uber, where he builds data processing and querying systems that scale along with Uber’s hypergrowth.