Presented By O'Reilly and Cloudera
Make Data Work
March 28–29, 2016: Training
March 29–31, 2016: Conference
San Jose, CA

Bringing the Apache Hadoop ecosystem to the Google Cloud Platform

Jennifer Wu (Cloudera), James Malone (Google)
11:50am–12:30pm Wednesday, 03/30/2016
Enterprise Adoption

Location: 230 C
Average rating: ***..
(3.75, 4 ratings)

Prerequisite knowledge

Attendees should have a basic knowledge of the Hadoop ecosystem and public cloud use cases.


With Apache Hadoop workloads moving into public cloud environments at an increasing rate, leading ecosystem players have aligned to provide the best possible Hadoop (including Apache Spark) experience for users. Jennifer Wu and James Malone offer an insider look at how Google—whose internal systems at various times inspired the creation of many Hadoop ecosystem projects—has integrated Hadoop components like HDFS, Impala, and Apache Spark with Google Cloud Platform technologies like Google Compute Engine (GCE), Bigtable, BigQuery, and Cloud Storage.

Jennifer and James cover scenarios for taking advantage of cloud elasticity to run entire production enterprise data hubs on Cloud Storage—specifically Hadoop, Spark, and Impala workloads on other Google Cloud Platform services such as Cloud Dataproc and Bigtable. They also explore the importance of Google’s growing collaboration with open source communities for meeting these goals, explaining how that collaboration benefits the entire ecosystem.

Photo of Jennifer Wu

Jennifer Wu


Jennifer Wu is director of product management for cloud at Cloudera, where she focuses on cloud services and data engineering. Previously, Jennifer worked as a product line manager at VMware, working on the vSphere and Photon system management platforms.

Photo of James Malone

James Malone


James Malone is a product manager for Google Cloud Platform and manages Cloud Dataproc and Apache Beam (incubating). Previously, James worked at Disney and Amazon. He’s a big fan of open source software because it shows what’s possible when people come together to solve common problems with technology. He also loves data, amateur radio, Disneyland, photography, running, and Legos.