Presented By
O’Reilly + Cloudera
Make Data Work
March 25-28, 2019
San Francisco, CA

Introduction to Flink via Flink SQL

Fabian Hueske (Ververica)
9:00am12:30pm Tuesday, March 26, 2019
Secondary topics:  Streaming, realtime analytics, and IoT

Who is this presentation for?

Data Engineers

Level

Beginner

Prerequisite knowledge

* Basics of SQL * Basics of stream processing

Materials or downloads needed in advance

Need a laptop with Docker and (optionally) IDE such as IntelliJ.

What you'll learn

* Running SQL on streaming data

Description

As data processing becomes more real-time, stream processing is becoming more important. Apache Flink makes it easier to build and manage stream processing applications. Flink’s new SQL interface is not only a great way to get started with Flink, but also to build and maintain production applications.

This hands-on session introduces Flink via the SQL interface. You will receive an overview of stream processing, and a survey of Apache Flink with its various modes of use. Then we’ll use Flink to run SQL queries on data streams and contrast this with the Flink data stream API.

Agenda:

Talk: Survey of Apache Flink and its interfaces
Talk: Intro into SQL on Flink
Unified API for Batch and Streaming
Executing SQL queries on Flink
Walk through the documentation

Hands-On: Setting up the SQL CLI Client
Running the first queries
Talk: SQL on Data Streams
Tables, Streams, and Materialized Views
Supported Operations
Event Time and Processing Time

Hands-On: Running queries on data streams
Windowed Queries
Event time queries
(considering cutting: Processing time queries
Considering cutting: Materializing Queries)

Talk: Flink APIs, Internals, Connectors & UDFs
Table API and SQL
Datastream API
Hands-on: Working with the datastream API

Photo of Fabian Hueske

Fabian Hueske

Ververica

Fabian Hueske is a committer and PMC member of the Apache Flink project. He was one of the three original authors of the Stratosphere research system, from which Apache Flink was forked in 2014. Fabian is a co-founder of Ververica, a Berlin-based startup devoted to fostering Flink, where he works as a software engineer and contributes to Apache Flink. He holds a PhD in computer science from TU Berlin and is currently spending a lot of his time writing a book, Stream Processing with Apache Flink.

Leave a Comment or Question

Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?

Join the conversation here (requires login)