Presented By O’Reilly and Cloudera
Make Data Work
September 11, 2018: Training & Tutorials
September 12–13, 2018: Keynotes & Sessions
New York, NY

In-Person Training
Hands-on data science with Python

Zachary Glassman (The Data Incubator)
9:00am–5:00pm Tuesday, 09/11/2018
Location: 1E 17

To attend a training course, you must be registered for a Platinum or Training pass; does not include access to tutorials on Tuesday.

Zachary Glassman leads a hands-on dive into building intelligent business applications using machine learning, walking you through all the steps of developing a machine learning pipeline. You'll explore data cleaning, feature engineering, model building and evaluation, and deployment and extend these models into two applications using a real-world dataset.

Prerequisites:

  • A working knowledge of Python
  • Familiarity with pandas (useful but not required)

The Data Incubator offers a foundation in building intelligent business applications using machine learning. Zachary Glassman walks you through all the steps of developing a machine learning pipeline. You’ll explore data cleaning, feature engineering, model building and evaluation, and deployment and extend these models into two applications using a real-world dataset. All work will be done in Python.

Outline

Anomaly detection

  • Data format and goal
  • Pandas basics
  • Limitations of time series data
  • Detrending and seasonality
  • Windowing and local scores
  • Setting thresholds for classification
  • Online learning

About your instructor

Photo of Zachary Glassman

Zachary Glassman is a data scientist in residence at the Data Incubator. Zachary has a passion for building data tools and teaching others to use Python. He studied physics and mathematics as an undergraduate at Pomona College and holds a master’s degree in atomic physics from the University of Maryland.

Conference registration

Get the Platinum pass or the Training pass to add this course to your package.

Leave a Comment or Question

Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?

Join the conversation here (requires login)

Comments

Picture of Zachary Glassman
Zachary Glassman | DATA SCIENTIST IN RESIDENCE
09/07/2018 10:50am EDT

Hi Suvrat,
I would try to familiarize yourself with Pandas, NumPy, and Matplotlib. We will be giving each person a cloud server for the duration of the course, so the only thing you will need installed
on your laptop is a web browser.

Zach

Suvrat Bansal | CHIEF DATA OFFICER
09/07/2018 10:35am EDT

Zach, are there any pre-requisites (software, libraries etc.) we need to be aware of in preparation of the course?