Presented By O’Reilly and Cloudera
Make Data Work
March 5–6, 2018: Training
March 6–8, 2018: Tutorials & Conference
San Jose, CA

What's new in Hadoop 3.0

Daniel Templeton (Cloudera), Andrew Wang (Cloudera)
11:00am11:40am Wednesday, March 7, 2018
Average rating: ****.
(4.67, 6 ratings)

Who is this presentation for?

  • Data engineers, pipeline engineers, and system administrators

Prerequisite knowledge

  • A working knowledge of Hadoop (useful but not required)

What you'll learn

  • Explore upcoming features in Hadoop 3.0 and learn when and how to apply them

Description

Apache Hadoop has been synonymous with open source big data analytics for over a decade. With the 3.0 major release, Apache Hadoop continues to evolve with the addition of significant new features like HDFS erasure coding, YARN Timeline Service v2, and MapReduce task-level optimization. Together, these new features improve the performance, scalability, and multitenancy capabilities of Hadoop.

Andrew Wang and Daniel Templeton offer an overview of new features and discuss current release management status and community testing efforts dedicated to making Hadoop 3.0 the best Hadoop major release yet.

Photo of Daniel Templeton

Daniel Templeton

Cloudera

Daniel Templeton has a long history in high-performance computing, open source communities, and technology evangelism. Today Daniel works on the YARN development team at Cloudera, focused on the resource manager, fair scheduler, and Docker support.

Photo of Andrew Wang

Andrew Wang

Cloudera

Andrew Wang is a software engineer on the HDFS team at Cloudera. Previously, he was a graduate student in the AMPLab at the University of California, Berkeley, advised by Ion Stoica, where he worked on research related to in-memory caching and quality of service. In his spare time, he enjoys going on bike rides, cooking, and playing guitar.

Comments on this page are now closed.

Comments

Corey Schooler | LEAD SOFTWARE ENGINEER
03/08/2018 6:44am PST

Will you be adding the slides to this page?

Picture of Daniel Templeton
Daniel Templeton | SOFTWARE ENGINEER
02/23/2018 4:43am PST

Diana, there are no features of Hadoop that are specific to Cloudera’s distribution. The Hadoop 3.0 that Cloudera will be shipping is the same Hadoop 3.0 that’s available from Apache. This talk covers the features of the open source Apache Hadoop 3.0.

Picture of Diana Maltsman
Diana Maltsman | ARCHITECT ADVISOR
02/21/2018 11:37pm PST

Will this be specific to the Cloudera distribution of Hadoop or general features of the open source?