Don Fox walks you through developing a machine learning pipeline, from prototyping to production. You’ll learn about data cleaning, feature engineering, model building and evaluation, and deployment and then extend these models into two applications from real-world datasets. All work will be done in Python.
Day 1: Anomaly detection
Day 2: Recommendation engine
Don Fox is a Boston-based data scientist in residence at the Data Incubator. Previously, Don developed numerical models for a geothermal energy startup. Born and raised in South Texas, Don holds a PhD in chemical engineering, where he researched renewable energy systems and developed computational tools to analyze the performance of these systems.
Get the Platinum pass or the Training pass to add this course to your package.
Comments on this page are now closed.
For exhibition and sponsorship opportunities, email strataconf@oreilly.com
For information on trade opportunities with O'Reilly conferences, email partners@oreilly.com
View a complete list of Strata Data Conference contacts
©2019, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • confreg@oreilly.com
Comments
Hi Travis Craven,
No need to install anything before hand. Every participant will get access to a remote account where we host our training curriculum and environment. On these accounts, we will be using Jupyter Notebook; you may want to install it on your personal machine if you want to get familiar with it.
Is there a preferred IDE to install before the training?
Hi Jagdish,
1. Make sure you are comfortable with Python. I might spend some 30-60 minutes in the beginning to go over some elementary Python.
2. regarding data sets, it would help if you are familiar with pandas, the better. It is the power package for data wrangling and manipulation. Take a look at https://pandas.pydata.org/pandas-docs/stable/getting_started/10min.html to easily get started with Pandas
Hi Don, We are planing to do some prep work before the training. can you suggest an documentation or work on some data set to prepare for the training.
thanks
Jagdish