Sep 9–12, 2019

Introducing Kubeflow (with special guests TensorFlow and Apache Spark)

Holden Karau (Google), Trevor Grant (IBM)
1:45pm2:25pm Thursday, September 12, 2019
Location: 230 B
Secondary topics:  Machine Learning

Who is this presentation for?

  • Data scientists and data engineers




Data science, machine learning, and artificial intelligence have exploded in popularity in the last five years, but the nagging question of how to put models into production remains. Engineers are typically tasked to build one-off systems to serve predictions that must be maintained amid a quickly evolving backend serving space that evolved from single machine to custom clusters to “serverless” to Docker to Kubernetes.

Holden Karau and Trevor Grant present Kubeflow—an open source project that makes it easy for users to move models from laptop to ML rig to training cluster to deployment. Learn exactly what Kubeflow is, why scalability is so critical for training and model deployment, and more. You’ll leave able to deploy models written in Python’s scikit-learn, R, TensorFlow, Spark, and more. The magic of Kubernetes allows you to write models on your laptop, deploy to an ML rig, and then DevOps can move that model into production with all the bells and whistles, such as monitoring, A/B tests, multiarm bandits, and security.

Prerequisite knowledge

  • Familiarity with Kubernetes and TensorFlow (or other ML libs)

What you'll learn

  • Learn what Kubeflow is, modeling training workflows, and deploying models to production
Photo of Holden Karau

Holden Karau


Holden Karau is a transgender Canadian open source developer advocate at Google focusing on Apache Spark, Beam, and related big data tools. Previously, she worked at IBM, Alpine, Databricks, Google (yes, this is her second time), Foursquare, and Amazon. Holden is the coauthor of Learning Spark, High Performance Spark, and another Spark book that’s a bit more out of date. She’s a committer on the Apache Spark, SystemML, and Mahout projects. When not in San Francisco, Holden speaks internationally about different big data technologies (mostly Spark). She was tricked into the world of big data while trying to improve search and recommendation systems and has long since forgotten her original goal. Outside of work, she enjoys playing with fire, riding scooters, and dancing.

Photo of Trevor Grant

Trevor Grant


Trevor Grant is an open source technical evangelist at IBM, a committer on the Apache Mahout, and contributor on Apache Streams (incubating), Apache Zeppelin, and Apache Flink projects. In former roles he called himself a data scientist, but the term is so over used these days. He holds an MS in applied math and an MBA from Illinois State University. Trevor’s an organizer of the newly formed Chicago Apache Flink meetup and has presented at Flink Forward, ApacheCon, Apache Big Data, and other meetups nationwide. Trevor was a combat medic in Afghanistan in 2009 and wrote an award-winning undergraduate thesis between missions. He has a dog and a cat and a ’64 Ford and he loves them all very much.

Leave a Comment or Question

Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?

Join the conversation here (requires login)

Contact us

For conference registration information and customer service

For more information on community discounts and trade opportunities with O’Reilly conferences

Become a sponsor

For information on exhibiting or sponsoring a conference

Contact list

View a complete list of O'Reilly AI contacts