Skip to main content

Apache Hive & Stinger: Petabyte Scale SQL, IN Hadoop

Arun Murthy (Cloudera ), Alan Gates (Hortonworks), Owen O'Malley (Cloudera)
Sponsored Gramercy Suite B
Average rating: ****.
(4.75, 4 ratings)
Slides:   1-PPTX 

Apache Hive is the de-facto standard for SQL-in-Hadoop today, with more enterprises relying on this open source project than on any alternative. Enterprises have asked for Hive to become more real-time and interactive‚ and the Hive community has responded.

Please join Arun Murthy, Owen O’Malley and Alan Gates to learn more about Stinger and improvements to Apache Hive, how much Hive has grown in the last 12 months, and how much further it will soon go.

Alan, Arun & Owen will cover how Hive:

  • Increases performance and scale by simplifying tasks using Apache Tez (Arun)
  • Decreases file size and effectiveness with the ORC file (Owen)
  • Expanding OLAP functionality, data type conformance, and adding ACID compliant updates (Alan)

This session is sponsored by Hortonworks

Photo of Arun Murthy

Arun Murthy


Arun is the lead of the MapReduce project in Apache Hadoop where he has been a full-time contributor to Apache Hadoop since its inception in 2006. He is a long-time committer and member of the Apache Hadoop PMC and jointly holds the current world sorting record using Apache Hadoop. Prior to co-founding Hortonworks, Arun was responsible for all MapReduce code and configuration deployed across the 42,000+ servers at Yahoo!. In essence, he was responsible for running Apache Hadoop?s MapReduce as a service for Yahoo!. Twitter: @acmurthy. He is directly responsible for every bit of code and configuration of Map-Reduce deployed at over 40,000 machines running Apache Hadoop.

Photo of Alan Gates

Alan Gates


Alan is a co-founder at Hortonworks and an original member of the engineering team that took Pig from a Yahoo! Labs research project to a successful Apache open source project. Alan also designed HCatalog and guided its adoption as an Apache Incubator project. Alan has a BS in Mathematics from Oregon State University and a MA in Theology from Fuller Theological Seminary. He is also the author of Programming Pig, a book from O’Reilly Press.

Photo of Owen O'Malley

Owen O'Malley


Owen has been contributing to Apache Hadoop since before it was first called Hadoop. He was the first committer added to the project and has provided technical leadership on MapReduce, and security. Using Hadoop in 2008 he set the world record for sorting a terabyte of data in 3.5 minutes and in 2009 he sorted a petabyte in 16.25 hours. In 2011, Own co-founded Hortonworks, which commercially supports and trains users of the Hadoop ecosystem. Prior to Hortonworks, Owen worked on Yahoo! Search’s WebMap project, which built the know web. Once ported to Apache Hadoop, it became the single largest low Hadoop application.

Comments on this page are now closed.


Picture of Alan Gates
Alan Gates
11/01/2013 10:13am EDT

I’ve just posted the slides.

Marek K Kolodziej
10/30/2013 4:31pm EDT

Would it be possible to post the slides here, like the other speakers have?


Sponsorship Opportunities

For exhibition and sponsorship opportunities, contact Susan Stewart at

Media Partner Opportunities

For information on trade opportunities with O'Reilly conferences email mediapartners

Press & Media

For media-related inquiries, contact Maureen Jennings at

Contact Us

View a complete list of Strata + Hadoop World 2013 contacts