Presented By O'Reilly and Cloudera
Make Data Work
December 1–3, 2015 • Singapore

Evolving from RDBMS to NoSQL + SQL

Jim Scott (MapR Technologies)
4:50pm–5:30pm Wednesday, 12/02/2015
Hadoop Platform
Location: 334-335 Level: Intermediate
Average rating: ****.
(4.00, 4 ratings)
Slides:   1-PPTX 

Prerequisite Knowledge

General understanding of relational databases


For the past 25 years applications have been getting built using an RDBMS with a predefined schema that forces data to conform with a schema on-write. Many people still think that they must use an RDBMS for applications, even though records in their datasets have no relation to one another. Additionally, those databases are optimized for transactional use, and data must be exported for analytics purposes. NoSQL technologies have turned that model on its side to deliver groundbreaking performance improvements.

I will walk through a music database with over 100 tables in the schema and show how to convert that model for use with a NoSQL database. I will show how to handle creating, updating, and deleting records, using column families for different types of data (and why).

I will then show how to use the exact same data without moving or transforming it to perform analytics, by leveraging Apache Drill’s ANSI-SQL capabilities on the NoSQL database.

Photo of Jim Scott

Jim Scott

MapR Technologies

Jim Scott is the director of enterprise strategy and architecture at MapR Technologies. Across his career, Jim has held positions running operations, engineering, architecture, and QA teams in the consumer packaged goods, digital advertising, digital mapping, chemical, and pharmaceutical industries. Jim has built systems that handle more than 50 billion transactions per day, and his work with high-throughput computing at Dow Chemical was a precursor to more standardized big data concepts like Hadoop. Jim is also the cofounder of the Chicago Hadoop Users Group (CHUG).