Mar 15–18, 2020

Presto on Kubernetes: Query anything, anywhere

4:15pm4:55pm Tuesday, March 17, 2020
Location: LL20A
Secondary topics:  Data Management and Storage

Who is this presentation for?

Data engineers, data architects, developers

Level

Intermediate

Description

Presto is an open source distributed SQL engine, widely recognized for its low-latency queries, high concurrency, and native ability to query multiple data sources. Proven at scale in a variety of use cases at Airbnb, Comcast, GrubHub, Facebook, FINRA, LinkedIn, Lyft, Netflix, Twitter, and Uber, in the last few years Presto experienced an unprecedented growth in popularity in on-premises and cloud deployments over object stores, Hadoop distributed file system (HDFS), NoSQL, and relational database management system (RDBMS) data stores.

Kamil Bajda-Pawlikowski explores deploying and using Presto across hybrid and multicloud environments, allowing you to easily deploy Presto on the Red Hat OpenShift Container Platform, Google Kubernetes Engine (GKE), Azure Kubernetes Service (AKS), and Amazon Elastic Container Service for Kubernetes (EKS).

Kubernetes reduces the burden and complexity of configuring, deploying, managing, and monitoring containerized applications. With Kubernetes, you’re able to easily deploy and manage a Presto cluster that provides Presto Coordinator high availability, Presto Worker autoscaling, and monitoring via Prometheus.

Prerequisite knowledge

  • A basic understanding of SQL, cloud, and Hadoop

What you'll learn

  • Gain an overview of Presto
  • Understand virtualized querying across multiple data sources
  • Learn about flexible deployment with Kubernetes on-premises and the cloud
Photo of Kamil Bajda-Pawlikowski

Kamil Bajda-Pawlikowski

Starburst

Kamil Bajda-Pawlikowski is a cofounder and CTO of the enterprise Presto company Starburst. Previously, Kamil was the chief architect at the Teradata Center for Hadoop in Boston, focusing on the open source SQL engine Presto, and the cofounder and chief software architect of Hadapt, the first SQL-on-Hadoop company (acquired by Teradata). Kamil began his journey with Hadoop and modern MPP SQL architectures about 10 years ago during a doctoral program at Yale University, where he co-invented HadoopDB, the original foundation of Hadapt’s technology. He holds an MS in computer science from Wroclaw University of Technology and both an MS and an MPhil in computer science from Yale University.

Leave a Comment or Question

Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?

Join the conversation here (requires login)

Contact us

confreg@oreilly.com

For conference registration information and customer service

partners@oreilly.com

For more information on community discounts and trade opportunities with O’Reilly conferences

Become a sponsor

For information on exhibiting or sponsoring a conference

pr@oreilly.com

For media/analyst press inquires