Presented By O'Reilly and Cloudera
Make Data Work
September 25–26, 2017: Training
September 26–28, 2017: Tutorials & Conference
New York, NY

Architecting a data platform

John Akred (Silicon Valley Data Science), Stephen O'Sullivan (Silicon Valley Data Science)
9:00am12:30pm Tuesday, September 26, 2017
Data Engineering & Architecture, Spark & beyond
Location: 1E 12/13 Level: Intermediate
Secondary topics:  Architecture
Average rating: ***..
(3.27, 11 ratings)

Who is this presentation for?

  • Technical people and leaders in the data management arena

Prerequisite knowledge

  • A basic knowledge of Hadoop and Spark

Materials or downloads needed in advance

  • A laptop (You'll be provided a GitHub link for sample code.)

What you'll learn

  • Understand how the various parts of the Hadoop and big data ecosystem fit together in production to create a data platform supporting batch, interactive, and real-time analytical workloads


What are the essential components of a data platform? John Akred and Stephen O’Sullivan explain how the various parts of the Hadoop, Spark, and big data ecosystems fit together in production to create a data platform supporting batch, interactive, and real-time analytical workloads.

By tracing the flow of data from source to output, John and Stephen explore the options and considerations for components, including acquisition from internal and external data sources, ingestion (offline and real-time processing), storage, analytics (batch and interactive), and providing data services (exposing data to applications). They’ll also give advice on tool selection, the function of the major Hadoop components and other big data technologies such as Spark and Kafka, and integration with legacy systems.

Photo of John Akred

John Akred

Silicon Valley Data Science

With over 15 years in advanced analytical applications and architecture, John Akred is dedicated to helping organizations become more data driven. As CTO of Silicon Valley Data Science, John combines deep expertise in analytics and data science with business acumen and dynamic engineering leadership.

Photo of Stephen O'Sullivan

Stephen O'Sullivan

Silicon Valley Data Science

A leading expert on big data architecture and Hadoop, Stephen O’Sullivan has 20 years of experience creating scalable, high-availability data and applications solutions. A veteran of @WalmartLabs, Sun, and Yahoo, Stephen leads data architecture and infrastructure at Silicon Valley Data Science.

Comments on this page are now closed.


Picture of John Akred
John Akred | CTO
09/26/2017 8:01am EDT

Hi Mohammed, sorry you missed us. At this link you can get the slides sent to you:

Picture of Mohammed Ayub
Mohammed Ayub | DATA SCIENTIST
09/26/2017 7:34am EDT

Unfortunately, will miss this session due to conflict. Can we get access to tutorial material?