Presented By O'Reilly and Cloudera
Make Data Work
22–23 May 2017: Training
23–25 May 2017: Tutorials & Conference
London, UK

Presto: Distributed SQL done faster

Wojciech Biela (Starburst), Łukasz Osipiuk (Teradata)
17:2518:05 Wednesday, 24 May 2017
Data engineering and architecture
Location: Capital Suite 10/11
Level: Beginner
Average rating: ****.
(4.00, 1 rating)

Who is this presentation for?

  • Everyone can get value from this presentation

Prerequisite knowledge

  • General knowledge of Hadoop and HDFS
  • Basic experience working with databases

What you'll learn

  • Gain an introduction to Presto, an open source distributed analytical SQL engine that enables users to run interactive queries over their datasets stored in various data sources, and learn its use cases

Description

Interactive analysis of data stored in HDFS and other data sources has been gaining traction, and the field has been rapidly growing in the past few years. Wojciech Biela and Łukasz Osipiuk offer an introduction to Presto, an open source distributed analytical SQL engine that enables users to run interactive queries over their datasets stored in various data sources, including HDFS (Hive/Hadoop), Amazon S3, and various SQL and NoSQL data stores.

Presto is developed under the Apache 2.0 license. It was started at Facebook as an initiative to enable interactive querying across a variety of data stores. The project has a large and growing community of users that include Airbnb, LinkedIn, Netflix, Twitter, and Uber. Wojciech and Łukasz explore Presto’s design fundamentals and core capabilities and cover recent functional additions to Presto as well as current and future development themes. Along the way, they also describe the major Presto installations (Facebook, Netflix, Uber) and their usage scenarios.

Photo of Wojciech Biela

Wojciech Biela

Starburst

Wojciech Biela is a cofounder of Starburst, where he’s responsible for product development. He has over 14 years’ experience building products and running engineering teams. Previously, Wojciech was the engineering manager at the Teradata Center for Hadoop, running the Presto engineering operations in Warsaw, Poland; built and ran the Polish engineering team for a subsidiary of Hadapt, a pioneer in the SQL-on-Hadoop space (acquired by Teradata in 2014); and built and led teams on multiyear projects from custom big ecommerce and SCM platforms to POS systems. Wojciech holds an MS in computer science from the Wroclaw University of Technology.

Photo of Łukasz Osipiuk

Łukasz Osipiuk

Teradata

Łukasz Osipiuk is a software engineer at the Teradata Center for Hadoop within Teradata Labs, where he is actively engaged in open source Presto development and architecture design. Łukasz was a core member of SQL-on-Hadoop startup Hadapt before its acquisition by Teradata in 2014. Previously, Łukasz was employed at GG Network, where he worked on its large-scale instant messenger core backend and distributed drive storage backend. He graduated from Warsaw University.