Introduction to Hadoop (the what)

Aaron Kimball (Cloudera, Inc.)
Location: E141/E142
Tags: cloud, hadoop
Please note: to attend, your registration must include Tutorials.
Average rating: ***..
(3.29, 17 ratings)

Thinking at Scale: Introduction to Hadoop
You know your data is big – you found Hadoop. What implications must you consider when working at this scale?
This lecture addresses common challenges and general best practices for scaling with your data.

MapReduce and HDFS

These tools provide the core functionality to allow you to store, process, and analyze big data. This lecture “lifts
the curtain” and explains how the technology works. You’ll understand how these components fit together and
build on one another to provide a scalable and powerful system.

The Hadoop Ecosystem

An introduction to other projects surrounding Hadoop, which complete the greater ecosystem of available large-
data processing tools.

Augmenting Existing Systems with Hadoop

Hadoop rarely replaces existing infrastructure, but rather enables you to do more with your data by providing a
scalable batch processing system. This lecture helps you understand how it all fits together.

Photo of Aaron Kimball

Aaron Kimball

Cloudera, Inc.

Aaron Kimball is a software engineer at Cloudera, Inc., the Commercial Hadoop company. Aaron is the principle developer of Sqoop, the SQL-to-Hadoop database import/export tool. Aaron has been working with Hadoop since early 2007, and contributes actively to its development. Through Cloudera, he additionally provides training to developers and system administrators working with Hadoop. Aaron holds a B.S. in Computer Science from Cornell University, and an M.S. in Computer Science and Engineering from the University of Washington.

Comments on this page are now closed.


Ryan Boyer
07/19/2010 6:41am PDT


Will you please put up a link to the slides.


  • Intel
  • Microsoft
  • Google
  • Facebook
  • Rackspace Hosting
  • (mt) Media Temple, Inc.
  • ActiveState
  • CommonPlaces
  • DB Relay
  • FireHost
  • GoDaddy
  • HP
  • HTSQL by Prometheus Research
  • Impetus Technologies Inc.
  • Infobright, Inc
  • JasperSoft
  • Kaltura
  • Marvell
  • Mashery
  • NorthScale, Inc.
  • Open Invention Network
  • OpSource
  • Oracle
  • Parallels
  • PayPal
  • Percona
  • Qualcomm Innovation Center, Inc.
  • Rhomobile
  • Schooner Information Technology
  • Silicon Mechanics
  • SourceGear
  • Symbian
  • VoltDB
  • WSO2
  • Linux Pro Magazine

Sponsorship Opportunities

For information on exhibition and sponsorship opportunities at the conference, contact Sharon Cordesse at

Download the OSCON Sponsor/Exhibitor Prospectus

Media Partner Opportunities

Download the Media & Promotional Partner Brochure (PDF) for information on trade opportunities with O'Reilly conferences or contact mediapartners@

Press and Media

For media-related inquiries, contact Maureen Jennings at

OSCON Newsletter

To stay abreast of conference news and to receive email notification when registration opens, please sign up for the OSCON Newsletter (login required)

OSCON 2.0 Ideas

Have an idea for OSCON to share?

Contact Us

View a complete list of OSCON contacts