Presented By O'Reilly and Cloudera
December 5-6, 2016: Training
December 6–8, 2016: Tutorials & Conference

A survey of time series analysis techniques for sensor data

Rajesh Sampathkumar (The Data Team)
1:45pm–2:25pm Thursday, December 8, 2016
Data science and advanced analytics
Location: 308/309 Level: Intermediate
Average rating: ****.
(4.00, 1 rating)

Prerequisite Knowledge

  • An understanding descriptive and inferential statistics
  • A cursory knowledge of basic time series analysis approaches
  • Familiarity with Python and experience doing data science on a Python stack (either in a distributed setting or a standalone machine)

What you'll learn

  • Understand the unique challenges that time series data from machine sensors poses
  • Gain an introduction to the key techniques and tools used in the analysis of such data
  • Learn the importance of the time series approaches and the differences from aggregate statistical data analysis approaches


One challenge when dealing with manufacturing sensor data analysis is to formulate an efficient model of the underlying physical system. Rajesh Sampathkumar shares his experience working with sensor data at scale to model a real-world manufacturing subsystem with simple techniques, such as moving average analysis, and advanced ones, like VAR, applied to the problem of predictive maintenance.

Rajesh begins by exploring product sensor data, using aggregate statistical methods that do not consider the time element of the data in the analysis. These methods are effective in certain classes of problems where the failure modes are well defined, despite their obvious deficiency of not incorporating the time dimension. Rajesh then explores classification- and regression-based machine-learning approaches respectively, according to the hypotheses set up from the data. After validating the scope and usefulness of aggregate statistical modeling approaches, Rajesh incorporates the time dimension into the analysis and discusses various relevant algorithms. Rajesh concludes by sharing practical experiences running through this gamut of options and outlining some best practices relevant for anybody else embarking on the same journey.

Photo of Rajesh Sampathkumar

Rajesh Sampathkumar

The Data Team

Rajesh Sampathkumar is senior consultant at the Data Team, a strategy consulting organization focused on big data, data analytics, and data science, where he works with clients in diverse industries to provide data science expertise relevant to their business and decision making. Rajesh has many years of experience in consulting, design, and engineering at a number of reputed organizations.