Presented By O'Reilly and Cloudera
Make Data Work
22–23 May 2017: Training
23–25 May 2017: Tutorials & Conference
London, UK

Architecting a data platform

John Akred (Silicon Valley Data Science), Stephen O'Sullivan (Silicon Valley Data Science)
9:0012:30 Tuesday, 23 May 2017
Level: Intermediate
Average rating: ***..
(3.64, 14 ratings)

Who is this presentation for?

  • Technical people and leaders in the data management arena

Prerequisite knowledge

  • A basic knowledge of Hadoop and Spark

Materials or downloads needed in advance

  • A laptop (You'll be provided a GitHub link for sample code.)

What you'll learn

  • Understand how the various parts of the Hadoop and big data ecosystem fit together in production to create a data platform supporting batch, interactive, and real-time analytical workloads


What are the essential components of a data platform? John Akred and Stephen O’Sullivan explain how the various parts of the Hadoop, Spark, and big data ecosystems fit together in production to create a data platform supporting batch, interactive, and real-time analytical workloads.

By tracing the flow of data from source to output, John and Stephen explore the options and considerations for components, including acquisition from internal and external data sources, ingestion (offline and real-time processing), storage, analytics (batch and interactive), and providing data services (exposing data to applications). They’ll also give advice on tool selection, the function of the major Hadoop components and other big data technologies such as Spark and Kafka, and integration with legacy systems.

Photo of John Akred

John Akred

Silicon Valley Data Science

With over 15 years in advanced analytical applications and architecture, John Akred is dedicated to helping organizations become more data driven. As CTO of Silicon Valley Data Science, John combines deep expertise in analytics and data science with business acumen and dynamic engineering leadership.

Photo of Stephen O'Sullivan

Stephen O'Sullivan

Silicon Valley Data Science

A leading expert on big data architecture and Hadoop, Stephen O’Sullivan has 20 years of experience creating scalable, high-availability data and applications solutions. A veteran of @WalmartLabs, Sun, and Yahoo, Stephen leads data architecture and infrastructure at Silicon Valley Data Science.

Leave a Comment or Question

Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?

Join the conversation here (requires login)


Picture of Stephen O'Sullivan
Stephen O'Sullivan | VP OF ENGINEERING
25/05/2017 11:31 BST

Hi Alexandre, you can get the slides here


Picture of Alexandre Berger
25/05/2017 10:23 BST

Hello John & Stephen, nice presentation. Are you planning to share the slides?