With Apache Hadoop workloads moving into public cloud environments at an increasing rate, leading ecosystem players have aligned to provide the best possible Hadoop (including Apache Spark) experience for users. Jennifer Wu and James Malone offer an insider look at how Google—whose internal systems at various times inspired the creation of many Hadoop ecosystem projects—has integrated Hadoop components like HDFS, Impala, and Apache Spark with Google Cloud Platform technologies like Google Compute Engine (GCE), Bigtable, BigQuery, and Cloud Storage.
Jennifer and James cover scenarios for taking advantage of cloud elasticity to run entire production enterprise data hubs on Cloud Storage—specifically Hadoop, Spark, and Impala workloads on other Google Cloud Platform services such as Cloud Dataproc and Bigtable. They also explore the importance of Google’s growing collaboration with open source communities for meeting these goals, explaining how that collaboration benefits the entire ecosystem.
Jennifer Wu is director of product management for cloud at Cloudera, where she focuses on cloud services and data engineering. Previously, Jennifer worked as a product line manager at VMware, working on the vSphere and Photon system management platforms.
James Malone is a product manager for Google Cloud Platform and manages Cloud Dataproc and Apache Beam (incubating). Previously, James worked at Disney and Amazon. He’s a big fan of open source software because it shows what’s possible when people come together to solve common problems with technology. He also loves data, amateur radio, Disneyland, photography, running, and Legos.
©2016, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.