Presented By O'Reilly and Cloudera
Make Data Work
31 May–1 June 2016: Training
1 June–3 June 2016: Conference
London, UK

Being successful with Apache Hadoop in the cloud

Jennifer Wu (Cloudera)
16:35–17:15 Thursday, 2/06/2016
Enterprise adoption
Location: Capital Suite 15/16 Level: Intermediate
Average rating: ***..
(3.00, 9 ratings)

Prerequisite knowledge

Attendees should have a basic understanding of cloud computing concepts.


Organizations are increasingly considering cloud deployments, whether for their entire operation or just a particular application. Running Apache Hadoop in the cloud changes how you deploy and run jobs. There are a number of factors to consider when running Hadoop workloads in the cloud:

  • How compute and storage can grow independently
  • How nodes can be added or removed from clusters and change based on events
  • How use of the object store and instance selection can affect the economical outcome
  • How clusters can be customized for the application given the array of instance choices

Jennifer Wu outlines concepts for successfully running Hadoop in the cloud, provides guidance on selecting cloud storage, covers real-world examples of Hadoop deployment patterns in public clouds, and demos Cloudera Director provisioning on AWS.

Photo of Jennifer Wu

Jennifer Wu


Jennifer Wu is director of product management for cloud at Cloudera, where she focuses on cloud services and data engineering. Previously, Jennifer worked as a product line manager at VMware, working on the vSphere and Photon system management platforms.