Presented By O'Reilly and Cloudera
Make Data Work
Sept 29–Oct 1, 2015 • New York, NY

Machine Learning 101

Alice Zheng (Amazon), Chris DuBois (Dato), Piotr Teterwak (Dato), Srikrishna Sridhar (Dato)
9:00am–5:00pm Tuesday, 09/29/2015
Data Science & Advanced Analytics
Location: 1 E6 / 1 E7 Level: Intermediate
Average rating: ***..
(3.63, 19 ratings)

Materials or downloads needed in advance

Please bring your laptop. For other prerequisites, please continue reading below.

Description

TUTORIAL PREREQUISITES

Welcome to Machine Learning 101! We will be learning the basics of machine learning by building real applications: recommender and image analysis with deep learning.

The tutorial landing page (https://dato.com/events/training/2015-strata-nyc.html) contains the agenda for the day as well as relevant material. Please follow the instructions for setting up your laptop PRIOR to arriving onsite.

If you run into any problems during installation, please email training@dato.com.

TUTORIAL DESCRIPTION

This hands-on tutorial provides a quick start to building intelligent business applications using machine learning. Learn about machine learning basics, feature engineering, recommender systems, and deep learning. We will build and deploy large-scale machine learning applications with Dato’s Machine Learning platform: GraphLab Create, Dato Distributed, and Dato Predictive Services. The program will center around building two applications: a content-based recommender that tells you which talks you might be interested in at Strata, and an image search application built using deep learning.

We will walk you through all the steps of prototyping and production: data cleaning, feature engineering, model building and evaluation, and deployment.

Please check back here prior to the tutorial date for installation instructions.

Topics:

  • Overview of machine learning
  • Feature engineering
  • Personalized recommenders
  • Content-based analysis
  • Factorization models
  • Building StrataNow—a personalized talk recommender for StrataConf
  • Deep learning
  • Deep learning models
  • Transfer learning and deep features
  • Building an image search application to find similar clothing
  • Deploying machine learning models in production
  • Constructing a real-time predictive service
  • Monitoring and evaluation
Photo of Alice Zheng

Alice Zheng

Amazon

Alice Zheng leads the machine learning optimization team on Amazon’s advertising platform. She specializes in research and development of machine learning methods, tools, and applications. Outside of work, she is writing a book, Mastering Feature Engineering. Previously, Alice worked at GraphLab/Dato/Turi, where she led the machine learning toolkits team and spearheaded user outreach. Prior to joining GraphLab, she was a researcher in the Machine Learning group at Microsoft Research, Redmond. Alice holds PhD and BA degrees in computer science and a BA in mathematics, all from UC Berkeley.

Photo of Chris DuBois

Chris DuBois

Dato

Chris DuBois is a data scientist focused on building tools for other data scientists. At Dato, Chris has helped design and implement tools for creating recommendation systems and for large-scale text analysis. His current work makes it simpler to train models that generalize well. After studying applied mathematics at Pomona College, he earned a PhD in statistics from the University of California, Irvine, where he researched latent variable models for social-network data occurring over time.

Photo of Piotr Teterwak

Piotr Teterwak

Dato

Piotr Teterwak works on the toolkit development team at Dato. He received a BA in computer science from Dartmouth College, where he conducted work exploring the learning of convolutional deep neural nets with applications in computer vision.

Photo of Srikrishna Sridhar

Srikrishna Sridhar

Dato

Krishna Sridhar is a data scientist at Dato. He holds a PhD in computer science from the University of Wisconsin-Madison, where he worked on high-performance software for large-scale problems in mathematical optimization and data analysis. Krishna’s work has been used in applications such as healthcare, industrial production planning, and machine learning.

Comments on this page are now closed.

Comments

Picture of Alice Zheng
Alice Zheng
09/22/2015 10:26pm EDT

Hi Celso,

The material/instructions can be found here: https://dato.com/events/training/2015-strata-nyc.html

This page will be updated soon with the link above.

Let us know if you run into any problems with the setup!

Celso Poderoso
09/22/2015 5:25am EDT

Hi, when materials and/or downloads will be available?
Thanks.