Presented By O’Reilly and Cloudera
Make Data Work
21–22 May 2018: Training
22–24 May 2018: Tutorials & Conference
London, UK

Hands-on data science with Python (Day 2)

Zachary Glassman (The Data Incubator)
Location: Capital Suite 7

Zachary Glassman offers a foundation in building intelligent business applications using machine learning, walking you through all the steps of developing a machine learning pipeline, from prototyping to production. You’ll explore data cleaning, feature engineering, model building and evaluation, and deployment and extend these models into two applications using real-world datasets. All work will be done in Python.

Outline

Day 1: Anomaly detection

  • Data format and goal
  • Limitations of time series data
  • Detrending and seasonality
  • Windowing and local scores
  • Setting thresholds for classification
  • Online learning

Day 2: Recommendation engine

  • Overview of data and its wrangling
  • Item-item correlations and finding similar items
  • User similarity and predicting user ratings
  • Collaborative filtering
  • Evaluating model performance
Photo of Zachary Glassman

Zachary Glassman

The Data Incubator

Zachary Glassman is a data scientist in residence at the Data Incubator. Zachary has a passion for building data tools and teaching others to use Python. He studied physics and mathematics as an undergraduate at Pomona College and holds a master’s degree in atomic physics from the University of Maryland.