You might be at the exploratory stage with Hadoop, or you may have run multiple pilots and are now looking to institutionalize the technology—it’s important to grasp where big data technologies will integrate in the long term.
This vision has been called the “data lake”. The data lake dream has been adopted by industry vendors, but what does it mean, and where is it headed?
In the real world, with cost and legacy considerations, this talk will show where the new big data technologies fit in. We’ll cover how you can plot a pragmatic path to realizing the benefits big data architectures offer: breaking down data silos, making it quicker to develop new applications, and ensuring data is delivered to the people who need it.
I’ll describe the four steps towards realizing the benefits of the scale-out data architecture being ushered in by Hadoop and its ecosystem technologies.
These steps will help you understand both the maturity of your own applications, and those offered by data technology vendors.
This talk is ideal for anyone managing the development and architecture of enterprise data systems, and those seeking to understand the direction being taken by the main players in big data technology.
Edd Dumbill is a technology analyst, writer and entrepreneur based in California. He’s helping drive businesses with data as VP Strategy for Silicon Valley Data Science.
A startup veteran, Edd was the founder and creator of the Expectnation conference management system, and a co-founder of the Pharmalicensing.com online intellectual property exchange.
An advocate and contributor to open source software, Edd has contributed to various projects, such as Debian and GNOME, and created the DOAP Vocabulary for describing software projects.