Christian Moscardi walks you through developing a machine learning pipeline, from prototyping to production, with the Jupyter platform, exploring data cleaning, feature engineering, model building and evaluation, and deployment in an industry-focused setting. Along the way, you’ll learn Jupyter best practices and the Jupyter settings and libraries that enable great visualizations.
Day 1: Recommendation engine
Overview of data and its wrangling
Item-item correlations and finding similar items
User similarity and predicting user ratings
Collaborative filtering
Evaluating model performance
Day 2: Anomaly detection
Data format and goal
Limitations of time series data
Detrending and seasonality
Windowing and local scores
Setting thresholds for classification
Online learning
Christian Moscardi is director of technology for the Data Incubator. Previously, Christian developed a CMS for food blogs, worked for Google, and researched and taught at Columbia. He organizes with BetaNYC, New York’s civic tech organization, and contributes to various civic data projects. His extracurricular activities include cooking, playing the piano, and exploring New York.
Get the Platinum pass to add this course to your package.
Comments on this page are now closed.
For exhibition and sponsorship opportunities, email jupytersponsorships@oreilly.com
For information on trade opportunities with JupyterCon, email partners@oreilly.com
View a complete list of JupyterCon contacts
©2017, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • confreg@oreilly.com
Comments
If we cover the agenda as planned it will be time well spent! Looking forward to the course and learning some practical tips and tricks in the machine learning workflows. Thanks in advance!