Apache Hadoop has been synonymous with open source big data analytics for over a decade. With the 3.0 major release, Apache Hadoop continues to evolve with the addition of significant new features like HDFS erasure coding, YARN Timeline Service v2, and MapReduce task-level optimization. Together, these new features improve the performance, scalability, and multitenancy capabilities of Hadoop.
Andrew Wang and Daniel Templeton offer an overview of new features and discuss current release management status and community testing efforts dedicated to making Hadoop 3.0 the best Hadoop major release yet.
Daniel Templeton has a long history in high-performance computing, open source communities, and technology evangelism. Today Daniel works on the YARN development team at Cloudera, focused on the resource manager, fair scheduler, and Docker support.
Andrew Wang is a software engineer on the HDFS team at Cloudera. Previously, he was a graduate student in the AMPLab at the University of California, Berkeley, advised by Ion Stoica, where he worked on research related to in-memory caching and quality of service. In his spare time, he enjoys going on bike rides, cooking, and playing guitar.
Comments on this page are now closed.
For exhibition and sponsorship opportunities, email strataconf@oreilly.com
For information on trade opportunities with O'Reilly conferences, email partners@oreilly.com
View a complete list of Strata Data Conference contacts
©2018, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • confreg@oreilly.com
Comments
Will you be adding the slides to this page?
Diana, there are no features of Hadoop that are specific to Cloudera’s distribution. The Hadoop 3.0 that Cloudera will be shipping is the same Hadoop 3.0 that’s available from Apache. This talk covers the features of the open source Apache Hadoop 3.0.
Will this be specific to the Cloudera distribution of Hadoop or general features of the open source?