Skip to main content

Building a Data Platform

John Akred (Silicon Valley Data Science), Richard Williamson (Silicon Valley Data Science), Stephen O'Sullivan (Data Whisperers)
Hadoop and Beyond
Ballroom F
Tutorial Please note: to attend, your registration must include Tutorials on Tuesday.
Average rating: ***..
(3.27, 22 ratings)
Slides:   external link

Tutorial Prerequisites

There are no downloads or pre-reads required, but attendees are encouraged to come with questions!

Tutorial Description

What are the essential components of a data platform? This tutorial will explain how the various parts of the Hadoop and big data ecosystem fit together in production to create a data platform supporting batch, interactive and realtime analytical workloads.

By tracing the flow of data from source to output, we’ll explore the options and considerations for components, including:

  • Acquisition: from internal and external data sources
  • Ingestion: offline and real-time processing
  • Storage
  • Providing data services: exposing data to applications
  • Analytics: batch and interactive
  • Data management: data security, lineage, metadata and quality

We’ll give also advice on:

  • tool selection
  • the function of the major Hadoop components and other big data technologies
  • hardware sizing and cloud provisioning
  • integration with legacy systems
Photo of John Akred

John Akred

CTO, Silicon Valley Data Science

With over 15 years in advanced analytical applications and architecture, John is dedicated to helping organizations become more data-driven. He combines deep expertise in analytics and data science with business acumen and dynamic engineering leadership.

Photo of Richard Williamson

Richard Williamson

Principal Engineer, Silicon Valley Data Science

Richard has been at the cutting edge of big data since its inception, leading multiple efforts to build multi-petabyte Hadoop platforms, maximizing business value by combining data science with big data. He has extensive experience creating advanced analytic systems using data warehousing and data mining technologies

Photo of Stephen O'Sullivan

Stephen O'Sullivan

Data Geek, Data Whisperers

A leading expert on big data architecture and Hadoop, Stephen brings over 20 years of experience creating scalable, high-availability, data and applications solutions. A veteran of WalmartLabs, Sun and Yahoo!, Stephen leads data architecture and infrastructure.

Comments on this page are now closed.


Leslie Weaver
02/11/2014 2:02am PST

Please disregard my earlier question. I signed up for the full video suite and it will be emailed to me when it is available. Great session and meets us right where we are.

Picture of Sophia DeMartini
Sophia DeMartini
02/11/2014 2:00am PST

Slide decks will be posted after talks have been concluded, if the speaker chooses to send us their slides.

Leslie Weaver
02/11/2014 1:51am PST

Same as others, would like the slide deck.

Srikant Dharwad
02/11/2014 1:43am PST

Can the tutorial slides be made available? Thanks

Kathy Yu
02/09/2014 8:21am PST

Hi Sandra & Megan – our customer service will be happy to help you change your tutorial choice. You can email us at, or we can make that change for you onsite at the registration desk.

megan yao
02/08/2014 10:54pm PST

same questions… i saw this one after the registration. =(

02/08/2014 11:59am PST

I would like to attend to this tutorial, I chose another one during registration, it is possible?