Presented By
O’Reilly + Cloudera
Make Data Work
March 25-28, 2019
San Francisco, CA

Cross-cloud model training and serving with Kubeflow

Holden Karau (Google), Francesca Lazzeri (Microsoft), Trevor Grant (IBM)
1:30pm5:00pm Tuesday, March 26, 2019
Secondary topics:  AI and Data technologies in the cloud, Model lifecycle management
Average rating: ***..
(3.00, 2 ratings)

Who is this presentation for?

  • Data scientists and data engineers looking to move their models into production



Prerequisite knowledge

  • Familiarity with ML and Python
  • A working knowledge of the shell environment

Materials or downloads needed in advance

  • A WiFi-enabled laptop (no firewall—SSH access, etc. required)

What you'll learn

  • Understand how to train and deploy models with Kubeflow across different cloud vendors


Holden Karau, Francesca Lazzeri, and Trevor Grant offer an overview of Kubeflow and walk you through using it to train and serve models across different cloud environments (and on-premises). You’ll use a script to do the initial setup work, so you can jump (almost) straight into training a model on one cloud and then look at how to set up serving in another cluster/cloud.

The first part of the session will involve training a model on either Google Kubernetes Engine or on your own laptop using Minikube (as desired); in the second part, you’ll take the trained model and deploy it to your choice of Google’s, Amazon’s, Microsoft’s, or IBM’s cloud and make it publicly accessible to real traffic.

To keep the course simple, you’ll focus on training on a simple mode. If you speed through everything, you can either keep deploying more more clouds (gotta catch ‘em all) or try training a more complex, more realistic model doing feature preprocessing (like GitHub issue classification).

Note: Accounts will be provided for Google’s and Microsoft’s cloud, but users of other clouds will have to use their own accounts.

Photo of Holden Karau

Holden Karau


Holden Karau is a transgender Canadian open source developer advocate at Google focusing on Apache Spark, Beam, and related big data tools. Previously, she worked at IBM, Alpine, Databricks, Google (yes, this is her second time), Foursquare, and Amazon. Holden is the coauthor of Learning Spark, High Performance Spark, and another Spark book that’s a bit more out of date. She is a committer on the Apache Spark, SystemML, and Mahout projects. When not in San Francisco, Holden speaks internationally about different big data technologies (mostly Spark). She was tricked into the world of big data while trying to improve search and recommendation systems and has long since forgotten her original goal. Outside of work, she enjoys playing with fire, riding scooters, and dancing.

Photo of Francesca Lazzeri

Francesca Lazzeri


Francesca Lazzeri is Senior Machine Learning Scientist on the cloud developer advocacy team at Microsoft. Francesca has multiple years of experience as a data scientist and data-driven business strategy expert; she is passionate about innovations in big data technologies and the applications of machine learning–based solutions to real-world problems. Her work on these issues covers a wide range of industries, including energy, oil and gas, retail, aerospace, healthcare, and professional services. Previously, she was a research fellow in business economics at Harvard Business School, where she performed statistical and econometric analysis within the Technology and Operations Management Unit and worked on multiple patent data-driven projects to investigate and measure the impact of external knowledge networks on companies’ competitiveness and innovation. Francesca is currently a mentor for PhD and postdoc students at the Massachusetts Institute of Technology and enjoys speaking at academic and industry conferences to share her knowledge and passion for AI, machine learning, and coding.

Photo of Trevor Grant

Trevor Grant


Trevor Grant is PMC Member of the Apache Mahout and Apache Streams projects. He is a tinker extraordinaire and does a poor job of documenting his projects on He has an M.S. of Applied Math, a dog, a cat, an M.B.A., and a home in Chicago. He speaks a fair amount at locations internationally, and in general, his talks are usually pretty fun.

Leave a Comment or Question

Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?

Join the conversation here (requires login)