Presented By O'Reilly and Cloudera
Make Data Work
December 1–3, 2015 • Singapore

Scaling document data up (way up) while scaling complexity down

Ted Dunning (MapR)
2:20pm–3:00pm Wednesday, 12/02/2015
Location: 333
Average rating: ****.
(4.00, 1 rating)

If you can simplify designs, you can develop systems faster. SQL compatibility means that you can use existing tools so you can avoid development costs entirely. Increased scale means that you have new business opportunities.

Flexible data model. SQL compatibility. Unlimited scale. Nearly all data systems require that you pick at most two of three and many only give you one out of three. You can, however, now have all three at the same time combined with substantially lower complexity than you would expect.

I will show how a real-world relational database can be massively simplified using document structure, how that database can be queried using SQL and how it can grow to the trillion-row, TB-scale required by modern applications. Using open source software like Apache Drill, Apache Parquet and the recently released Ojai project, you can replicate these results in your own systems.

This session is sponsored by MapR Technologies

Photo of Ted Dunning

Ted Dunning


Ted Dunning has been involved with a number of startups with the latest being MapR Technologies where he is Chief Application Architect working on advanced Hadoop-related technologies. He is also a PMC member for the Apache Zookeeper and Mahout projects. Opinionated about software and data-mining and passionate about open source, he is an active participant of Hadoop and related communities and loves helping projects get going with new technologies.