Presented By O'Reilly and Cloudera
Make Data Work
September 25–26, 2017: Training
September 26–28, 2017: Tutorials & Conference
New York, NY

How machine learning with open source tools helps everyone build better products

Michelle Casbon (Google)
2:55pm3:35pm Thursday, September 28, 2017
Data science & advanced analytics, Machine Learning & Data Science
Location: 1A 06/07 Level: Intermediate
Secondary topics:  Text
Average rating: ****.
(4.00, 4 ratings)

Who is this presentation for?

  • Engineers, data scientists, designers, and product managers

Prerequisite knowledge

  • Familiarity with system architectures, distributed computing tools, machine learning, and NLP (useful but not required)

What you'll learn

  • Explore Qordoba’s architecture for handling billions of localized strings in many different languages
  • Understand how to make your own products better with localization


Building products that feel native to every user, regardless of language, is the best way to establish a user base across the globe. To do this, a product needs to support a variety of locales. The challenge with supporting multiple locales is the maintenance and generation of localized strings, which are deeply integrated into many facets of a product.

Michelle Casbon explores the machine learning and natural language processing that enables Qordoba to generate high-quality translations in many different languages and describes the techniques Qordoba uses to provide continuous deployment of localized strings, live syncing across platforms (mobile, web, Photoshop, Sketch, Help Desk, etc.), content generation for any locale, and emotional response. Michelle also explores Qordoba’s architecture for handling billions of localized strings in many different languages, using Apache Spark and Apache PredictionIO (incubating) for natural language processing, Kubernetes and Docker for containerized deployment, scaling, and management, Apache Cassandra and MariaDB as a storage layer, and Scala and Akka as an orchestration layer.

Photo of Michelle Casbon

Michelle Casbon


Michelle Casbon is a senior engineer on the Google Cloud Platform developer relations team, where she focuses on open source contributions and community engagement for machine learning and big data tools. Michelle’s development experience spans more than a decade and has primarily focused on multilingual natural language processing, system architecture and integration, and continuous delivery pipelines for machine learning applications. Previously, she was a senior engineer and director of data science at several San Francisco-based startups, building and shipping machine learning products on distributed platforms using both AWS and GCP. She especially loves working with open source projects and is a contributor to Kubeflow. Michelle holds a master’s degree from the University of Cambridge.

Comments on this page are now closed.


Picture of Michelle Casbon
Michelle Casbon | SENIOR ENGINEER
10/04/2017 4:18pm EDT

Thanks, Bostjan – glad you were there! Slides have been uploaded & should appear here shortly.

10/04/2017 1:34am EDT

Dear Michelle, it was an excellent sesion. Can you please share the slides?