Introduction to Hive - CANCELED

Aaron Kimball (Cloudera, Inc.)
Location: E141/E142
Please note: to attend, your registration must include Tutorials.

An introduction to Hive and Hadoop

The basics.

Getting data into Hive

Learn how to create tables in Hive, using the appropriate table properties and data types. Then we will cover
loading data into Hive from files on a local file system, files in HDFS or data in an RDBMS.


The Hive Query Language is an SQL-like language for querying data in Hadoop. It supports a subset of SQL-92
features, but also adds Hive/Hadoop-specific enhancements.

Query Execution

This section will explain how Hive converts queries into MapReduce code that is executed on the Hadoop cluster.

Partitioning and Bucketing

A powerful feature of Hive is the ability to partition and bucket your data. Partitioning is a way of organizing large
data sets into distinct subsets. Bucketing is useful for sampling a fraction of data.

Best Practices

These are specific recommendations for configuring Hive, handling data properly, and designing for performance.

Photo of Aaron Kimball

Aaron Kimball

Cloudera, Inc.

Aaron Kimball is a software engineer at Cloudera, Inc., the Commercial Hadoop company. Aaron is the principle developer of Sqoop, the SQL-to-Hadoop database import/export tool. Aaron has been working with Hadoop since early 2007, and contributes actively to its development. Through Cloudera, he additionally provides training to developers and system administrators working with Hadoop. Aaron holds a B.S. in Computer Science from Cornell University, and an M.S. in Computer Science and Engineering from the University of Washington.

  • Intel
  • Microsoft
  • Google
  • Facebook
  • Rackspace Hosting
  • (mt) Media Temple, Inc.
  • ActiveState
  • CommonPlaces
  • DB Relay
  • FireHost
  • GoDaddy
  • HP
  • HTSQL by Prometheus Research
  • Impetus Technologies Inc.
  • Infobright, Inc
  • JasperSoft
  • Kaltura
  • Marvell
  • Mashery
  • NorthScale, Inc.
  • Open Invention Network
  • OpSource
  • Oracle
  • Parallels
  • PayPal
  • Percona
  • Qualcomm Innovation Center, Inc.
  • Rhomobile
  • Schooner Information Technology
  • Silicon Mechanics
  • SourceGear
  • Symbian
  • VoltDB
  • WSO2
  • Linux Pro Magazine

Sponsorship Opportunities

For information on exhibition and sponsorship opportunities at the conference, contact Sharon Cordesse at

Download the OSCON Sponsor/Exhibitor Prospectus

Media Partner Opportunities

Download the Media & Promotional Partner Brochure (PDF) for information on trade opportunities with O'Reilly conferences or contact mediapartners@

Press and Media

For media-related inquiries, contact Maureen Jennings at

OSCON Newsletter

To stay abreast of conference news and to receive email notification when registration opens, please sign up for the OSCON Newsletter (login required)

OSCON 2.0 Ideas

Have an idea for OSCON to share?

Contact Us

View a complete list of OSCON contacts