Skip to main content

Building a Data Platform

John Akred (Silicon Valley Data Science), Richard Williamson (Silicon Valley Data Science), Stephen O'Sullivan (Data Whisperers)
Hadoop Platform Grand Ballroom West
Tutorial Please note: to attend, your registration must include Tutorials on Monday.
Average rating: ***..
(3.71, 17 ratings)
Slides:   external link

What are the essential components of a data platform? This tutorial will explain how the various parts of the Hadoop and big data ecosystem fit together in production to create a data platform supporting batch, interactive and realtime analytical workloads.

By tracing the flow of data from source to output, we’ll explore the options and considerations for components, including:

  • Acquisition: from internal and external data sources
  • Ingestion: offline and real-time processing
  • Storage
  • Providing data services: exposing data to applications
  • Analytics: batch and interactive
  • Data management: data security, lineage, metadata and quality

We’ll give also advice on:

  • tool selection
  • the function of the major Hadoop components and other big data technologies
  • hardware sizing and cloud provisioning
  • integration with legacy systems
Photo of John Akred

John Akred

Silicon Valley Data Science

With over 15 years in advanced analytical applications and architecture, John is dedicated to helping organizations become more data-driven. He combines deep expertise in analytics and data science with business acumen and dynamic engineering leadership.

Photo of Richard Williamson

Richard Williamson

Silicon Valley Data Science

Richard has been at the cutting edge of big data since its inception, leading multiple efforts to build multi-petabyte Hadoop platforms, maximizing business value by combining data science with big data. He has extensive experience creating advanced analytic systems using data warehousing and data mining technologies

Photo of Stephen O'Sullivan

Stephen O'Sullivan

Data Whisperers

A leading expert on big data architecture and Hadoop, Stephen brings over 20 years of experience creating scalable, high-availability, data and applications solutions. A veteran of WalmartLabs, Sun and Yahoo!, Stephen leads data architecture and infrastructure.

Comments on this page are now closed.


soumya naik
10/25/2013 10:17pm EDT

Does this session require any pre-reads?

Picture of Stephen O'Sullivan
Stephen O'Sullivan
10/25/2013 3:35pm EDT

Prashant, no software needs to be installed.


10/25/2013 2:32pm EDT

Hi In order to participate, do you want us to install any software beforehand? – Thanks,Prashant


Sponsorship Opportunities

For exhibition and sponsorship opportunities, contact Susan Stewart at

Media Partner Opportunities

For information on trade opportunities with O'Reilly conferences email mediapartners

Press & Media

For media-related inquiries, contact Maureen Jennings at

Contact Us

View a complete list of Strata + Hadoop World 2013 contacts