Presented By O'Reilly and Cloudera
Make Data Work
September 26–27, 2016: Training
September 27–29, 2016: Tutorials & Conference
New York, NY

Running Presto and Spark on AWS: From zero to insight in less than five minutes

Jonathan Fritz (Amazon Web Services)
2:05pm–2:45pm Thursday, 09/29/2016
Location: 1 E 09
Tags: cloud
Average rating: *****
(5.00, 1 rating)

What you'll learn

  • Explore how organizations are deploying big data frameworks with Amazon Web Services (AWS)
  • Understand how to quickly and securely run Spark and Presto on AWS
  • Description

    Organizations from small startups to large enterprises are increasingly using open source frameworks such as Apache Hadoop, Spark, and Presto to address a broad range of analytic use cases, including business intelligence, stream processing, and machine learning. However, with any big data project comes the risk of uncapped costs, delayed timelines, expensive infrastructure, and difficult choices about where to focus in the open source toolset.
    Jonathan Fritz explains how organizations are deploying these and other big data frameworks with Amazon Web Services (AWS) and how you too can quickly and securely run Spark and Presto on AWS. Jonathan demonstrates how to lower costs and accelerate deployment of big data applications, using Amazon EMR to easily create a Hadoop cluster running Spark and Presto and querying data in Amazon S3 using ANSI SQL. Jonathan then explores how you can use Amazon S3 as a highly scalable, durable, and secure data lake by decoupling compute from storage, before outlining best practices to lower costs using Amazon EC2 Spot Instances and discussing how to secure your clusters using AWS’s extensive security capabilities.

    This session is sponsored by Amazon.

    Photo of Jonathan Fritz

    Jonathan Fritz

    Amazon Web Services

    Jonathan Fritz is a senior product manager at Amazon Elastic MapReduce (EMR), a managed service that enables businesses, researchers, data analysts, and developers to easily and cost-effectively process vast amounts of data using Hadoop, Spark, and Presto. Previously, Jonathan was the founder and CEO of Eleven Media Group and performed research in organic chemistry and nanotechnology in the Maurer Group at Washington University in St. Louis. He holds an MBA from the Stanford Graduate School of Business and a bachelor’s degree in chemistry with minor in biology from Washington University in St. Louis. He received a certificate for accomplishment in entrepreneurship from the Skandalaris Center for Entrepreneurial Studies.