Presented By O’Reilly and Cloudera
Make Data Work
March 5–6, 2018: Training
March 6–8, 2018: Tutorials & Conference
San Jose, CA

Time series data: Architecture and use cases

Ted Malaska (Capital One)
1:30pm5:00pm Tuesday, March 6, 2018
Secondary topics:  Graphs and Time-series
Average rating: **...
(2.80, 5 ratings)

Who is this presentation for?

  • Anyone working with data whose defining dimension is time

Prerequisite knowledge

  • A basic understanding of time series data models
  • A use case that involves time series data

What you'll learn

  • Explore all aspects of time series data

Description

The ongoing and steep increase in the number of internet-connected devices is inescapable, but traditional data processing pipelines are poorly equipped to deal with streaming data and other data whose defining dimension is time.

If you have data that has a time factor to it, then you need to think in terms of time series datasets. Ted Malaska explores time series in all of its forms, from tumbling windows to sessionization in batch or in streaming. Ted dives into real-world use cases and shares patterns to help you get the most value from your datasets. You’ll gain exposure to the tools and background you need to be successful in the world of time-oriented data.

Photo of Ted Malaska

Ted Malaska

Capital One

Ted Malaska is a director of enterprise architecture at Capital One. Previously, he was the director of engineering in the Global Insight Department at Blizzard; principal solutions architect at Cloudera, helping clients find success with the Hadoop ecosystem; and a lead architect at the Financial Industry Regulatory Authority (FINRA). He has contributed code to Apache Flume, Apache Avro, Apache Yarn, Apache HDFS, Apache Spark, Apache Sqoop, and many more. Ted is a coauthor of Hadoop Application Architectures, a frequent speaker at many conferences, and a frequent blogger on data architectures.