Skip to main content
Make Data Work
Oct 15–17, 2014 • New York, NY

The Data Lake Dream

Edd Wilder-James (Google)
9:50am–10:10am Wednesday, 10/15/2014
Data-Driven Business Day
Location: E 20/ E 21
Average rating: ***..
(3.00, 3 ratings)
Slides:   1-PPTX 

You might be at the exploratory stage with Hadoop, or you may have run multiple pilots and are now looking to institutionalize the technology—it’s important to grasp where big data technologies will integrate in the long term.

This vision has been called the “data lake”. The data lake dream has been adopted by industry vendors, but what does it mean, and where is it headed?

In the real world, with cost and legacy considerations, this talk will show where the new big data technologies fit in. We’ll cover how you can plot a pragmatic path to realizing the benefits big data architectures offer: breaking down data silos, making it quicker to develop new applications, and ensuring data is delivered to the people who need it.

I’ll describe the four steps towards realizing the benefits of the scale-out data architecture being ushered in by Hadoop and its ecosystem technologies.

  • Life Before Hadoop
  • Hadoop is Introduced
  • Growth of the Data Lake
  • A Data Lake and Application Cloud

These steps will help you understand both the maturity of your own applications, and those offered by data technology vendors.

This talk is ideal for anyone managing the development and architecture of enterprise data systems, and those seeking to understand the direction being taken by the main players in big data technology.

Photo of Edd Wilder-James

Edd Wilder-James


Edd Dumbill is a technology analyst, writer and entrepreneur based in California. He’s helping drive businesses with data as VP Strategy for Silicon Valley Data Science.

Edd was the founding program chair for the O’Reilly Strata, and chaired the Open Source Convention for six years. He was the Founding Editor of the journal Big Data.

A startup veteran, Edd was the founder and creator of the Expectnation conference management system, and a co-founder of the online intellectual property exchange.

An advocate and contributor to open source software, Edd has contributed to various projects, such as Debian and GNOME, and created the DOAP Vocabulary for describing software projects.

Edd has written four books, including O’Reilly’s “Learning Rails”. He writes regularly on Google+ and on his blog at