Skip to main content
Make Data Work
Oct 15–17, 2014 • New York, NY

From Oracle to Hadoop

Guy Harrison (Dell Software), David Robson (Dell Software), Kathleen Ting (Cloudera)
1:45pm–2:25pm Thursday, 10/16/2014
Hadoop Platform
Location: Hall A 23/24
Average rating: ***..
(3.71, 7 ratings)
Slides:   1-PPTX 

From Oracle to Hadoop: Unlocking Hadoop for Your RDBMS with Apache Sqoop and Other Tools

Apache Hadoop is great for storing large amounts of unstructured data, but when analyzing this data, users need to reference data from existing RDBMS based systems. We’ll look at how to transfer large volumes of data from Oracle into Hadoop efficiently with high scalability. We’ll also look at some strategies to keep this data up to date and place minimal load on our existing systems. In addition, we will look at strategies for Hadoop to RDBMS data flows such as moving aggregated data from Hadoop to RDBMS and consider how Hadoop may be used alongside an RDBMS as a long term archive or as a long term transaction or audit log. We will discuss the new features of Apache Sqoop 2.0 and the merging of the Dell/Quest connector for Oracle into the Sqoop core, providing Sqoop with much improved scalability and manageability.

Photo of Guy Harrison

Guy Harrison

Dell Software

Guy Harrison is Executive Director of Research and Development at Dell Software. Guy is the author of Oracle Performance Survival Guide (Prentice Hall, 2009) and MySQL Stored Procedure Programming (OReilly with Steven Feuerstein) as well as other books, articles and presentations on database technology. He also writes a monthly column for Database Trends and Applications ( He is contributing to the upcoming Oracle Exadata Expert Handbook (Pearson, 2014).

Guy can be found on the internet at, on e-mail at and is @guyharrison on twitter.

Photo of David Robson

David Robson

Dell Software

David Robson is a principal technologist at Dell Software. He is the lead developer of the Dell Oracle connector for Hadoop which is currently in the process of being donated to the Apache SQOOP project, and was the originator of Dell’s Toad for Hadoop project. David has a background in Oracle administration and development as well as Java and Hadoop development. David lives in Melbourne, Australia. He can be reached at

Photo of Kathleen Ting

Kathleen Ting


Kathleen Ting (@kate_ting) is currently a technical account manager at Cloudera where she helps strategic customers deploy and use the Hadoop ecosystem in production. She’s a frequent conference speaker, has contributed to several projects in the open source community, and is a committer and PMC member on Sqoop. Kathleen is also a co-author of O’Reilly’s Apache Sqoop Cookbook.

Comments on this page are now closed.


Picture of Guy Harrison
Guy Harrison
10/16/2014 10:52am EDT

Slides (and maybe video) will be on the Strata site soon, but right now you can get them from

Justin Ellsworth
10/16/2014 10:33am EDT

I enjoyed your presentation greatly. Can you please post the slides for us to access?

Jani Syed
10/16/2014 9:55am EDT

Hi, can you post the slides, maybe on slideshare?