Presented By O'Reilly and Cloudera
December 5-6, 2016: Training
December 6–8, 2016: Tutorials & Conference
Singapore

Robust stream processing with Apache Flink

Aljoscha Krettek (data Artisans)
4:15pm–4:55pm Thursday, December 8, 2016
IoT and intelligent real-time applications
Location: 308/309 Level: Intermediate
Tags: streaming
Average rating: *****
(5.00, 3 ratings)

What you'll learn

  • Learn what it takes to do correct, robust, stateful, computation over data streams in production
  • Gain exposure to Apache Flink's unique feature set and get a feel for what it's like to develop an application with Apache Flink

Description

Aljoscha Krettek offers a very short introduction to stream processing before diving into writing code and demonstrating the features in Apache Flink that make truly robust stream processing possible, with a focus on correctness and robustness in stream processing.

All of this will be done in the context of a real-time analytics application that we’ll be modifying on the fly based on the topics we’re working though, as Aljoscha exercises Flink’s unique features, demonstrates fault recovery, clearly explains why event time is such an important concept in robust, stateful stream processing, and covers the features you need in a stream processor to do robust, stateful stream processing in production.

We’ll also use a real-time analytics dashboard to visualize the results we’re computing in real time, allowing us to easily see the effects of the code we’re developing as we go along.

Topics include:

  • Apache Flink
  • Stateful stream processing
  • Event time versus processing time
  • Fault tolerance
  • State management in the face of faults
  • Savepoints
  • Data reprocessing
Photo of Aljoscha Krettek

Aljoscha Krettek

data Artisans

Aljoscha Krettek is a PMC member at Apache Flink, where he mainly works on the Streaming API and also designed and implemented he most recent additions to the windowing and state APIs. Aljoscha is a cofounder and software engineer at data Artisans. Previously, he worked at IBM Germany and at the IBM Almaden Research Center in San Jose. Aljoscha has spoken at Hadoop Summit, Flink Forward, and several meetups about stream processing and Apache Flink. He studied computer science at TU Berlin.