Presented By O'Reilly and Cloudera
Make Data Work
March 13–14, 2017: Training
March 14–16, 2017: Tutorials & Conference
San Jose, CA

Architecting and building enterprise-class Spark and Hadoop in cloud environments

James Malone (Google), John Mikula (Google Cloud)
1:30pm5:00pm Tuesday, March 14, 2017
Secondary topics:  Architecture, Cloud
Average rating: **...
(2.00, 6 ratings)

What you'll learn

  • Learn how to use managed Spark and Hadoop solutions in public clouds alongside cloud products for storage, analysis, and message queues to meet enterprise requirements via the Spark and Hadoop ecosystem

Description

James Malone explores using managed Spark and Hadoop solutions in public clouds alongside cloud products for storage, analysis, and message queues to meet enterprise requirements via the Spark and Hadoop ecosystem. To illustrate the concepts, James walks you through hands-on exercises using the Google Cloud Platform.

Topics include:

  • How cloud architecture is different—design differences and pros and cons; separation of compute and storage; persistent versus ephemeral; large multitenant design versus separated or distributed designs; and security, auditing, and management differences
  • Why you should use a managed Spark and Hadoop solution in a cloud
  • Using multiple cloud products alongside managed Spark and Hadoop clusters—block storage (Google Cloud Storage, Amazon S3, Azure Blob Storage); analytics services (Google BigQuery, Amazon Redshift, etc.); messaging systems (Google Pub/Sub, SQS, etc.), and streaming solutions (Amazon Kinesis, Google Dataflow, Azure Stream Analytics)
Photo of James Malone

James Malone

Google

James Malone is a product manager for Google Cloud Platform and manages Cloud Dataproc and Apache Beam (incubating). Previously, James worked at Disney and Amazon. He’s a big fan of open source software because it shows what’s possible when people come together to solve common problems with technology. He also loves data, amateur radio, Disneyland, photography, running, and Legos.

Photo of John Mikula

John Mikula

Google Cloud

John Mikula is a tech lead for Google Cloud, where he manages the team focused on enterprise features for Google Cloud Dataproc.