Presented By O’Reilly and Cloudera
Make Data Work
March 5–6, 2018: Training
March 6–8, 2018: Tutorials & Conference
San Jose, CA

2-day training courses

All training courses take place 9:00am - 5:00pm, Monday, March 5 through Tuesday, March 6. In order to maintain a high level of hands-on learning and instructor interaction, each training course is limited in size.

Participants should plan to attend both days of this 2-day training course. Training passes do not include access to tutorials on Tuesday.

Monday, March 5 - Tuesday, March 6

9:00am - 5:00pm Monday, March 5 & Tuesday, March 6
Location: 212 A-B
Brooke Wenig (Databricks)
Brooke Wenig walks you through the core APIs for using Spark, fundamental mechanisms and basic internals of the framework, SQL and other high-level data access tools, and Spark’s streaming capabilities and machine learning APIs. Read more.
9:00am - 5:00pm Monday, March 5 & Tuesday, March 6
Location: 114
Jesse Anderson (Big Data Institute)
To handle real-time big data, you need to solve two difficult problems: how do you ingest that much data and how will you process that much data? Jesse Anderson explores the latest real-time frameworks (both open source and managed cloud services), discusses the leading cloud providers, and explains how to choose the right one for your company. Read more.
9:00am - 5:00pm Monday, March 5 & Tuesday, March 6
Location: San Jose Ballroom (salon 1&2), Marriott
Robert Schroll (The Data Incubator)
The TensorFlow library enables the use of data flow graphs for numerical computations, with automatic parallelization across several CPUs or GPUs, making it ideal for implementing neural networks and other machine learning algorithms. Robert Schroll demonstrates TensorFlow's capabilities and walks you through building machine learning models on real-world data. Read more.
9:00am - 5:00pm Monday, March 5 & Tuesday, March 6
Location: 212 C
Brian Bloechle Bloechle (Cloudera), Glynn Durham (Cloudera)
Average rating: *****
(5.00, 1 rating)
Brian Bloechle demonstrates how to implement typical data science workflows using Apache Spark. You'll learn how to wrangle and explore data using Spark SQL DataFrames and how to build, evaluate, and tune machine learning models using Spark MLlib. Read more.
9:00am - 5:00pm Monday, March 5 & Tuesday, March 6
Location: 212 D
Angie Ma (Faculty), Maria Diaz (ASI Data Science)
Average rating: ****.
(4.00, 2 ratings)
Angie Ma offers a condensed introduction to key data science and machine learning concepts and techniques, showing you what is (and isn't) possible with these exciting new tools and how they can benefit your organization. Read more.
9:00am - 5:00pm Monday, March 5 & Tuesday, March 6
Location: 111
Delip Rao (AI Foundation), Brian McMahan (Wells Fargo)
Average rating: *****
(5.00, 1 rating)
PyTorch is a recent deep learning framework from Facebook that is gaining massive momentum in the deep learning community. Its fundamentally flexible design makes building and debugging models straightforward, simple, and fun. Delip Rao and Brian McMahan walk you through PyTorch's capabilities and demonstrate how to use PyTorch to build deep learning models and apply them to real-world problems. Read more.
9:00am - 5:00pm Monday, March 5 & Tuesday, March 6
Location: Willow Glen (1&2), Marriott
Zachary Glassman (The Data Incubator)
Zachary Glassman demonstrates how to build intelligent business applications using machine learning, taking you through each step in developing a machine learning pipeline, from prototyping to production. You'll explore data cleaning, feature engineering, model building and evaluation, and deployment and extend your knowledge by building two applications from real-world datasets. Read more.