Presented By
O’Reilly + Cloudera
Make Data Work
29 April–2 May 2019
London, UK

Professional Kafka development (Day 2)

Jesse Anderson (Big Data Institute)
Location: London Suite 2

Who is this presentation for?

  • You're a software engineer, data engineer, or data scientist.

Level

Intermediate

Prerequisite knowledge

  • A working knowledge of Java
  • Familiarity with big data technologies (useful but not required)

What you'll learn

  • Learn how to create large-scale real-time systems using Apache Kafka
  • Understand how real-time distributed systems are different from batch systems
  • Discover how to create Kafka producers and consumers
  • Learn how to use Apache Avro with Kafka for truly enterprise-grade solutions
  • Learn how to use Kafka Streams to create data pipelines
  • Learn how to configure Kafka Connect to put data into and take out of Kafka
  • See how to use your existing SQL skills with KSQL
  • Understand best practices and common architectural patterns when creating solutions with Kafka

Description

Outline

Day 1

Data at scale

  • Data movement concepts
  • Moving data at scale

Kafka concepts

  • Kafka system
  • Basic concepts
  • Advanced concepts

Developing with Kafka

  • Using Apache Maven
  • Kafka API
  • Kafka API caveats

Advanced Kafka development

  • Advanced consumers and producers
  • Advanced Offset Handling
  • Transactions
  • Multithreading consumers

Day 2

Kafka and Avro

  • Why serialize
  • Avro and serialization formats

Kafka Connect

  • Using Kafka Connect
  • Importing from JDBC
  • Exporting to HDFS

Kafka Streams

  • Kafka Streams
  • Kafka Streams API

KSQL

  • Using KSQL

Conclusion

Photo of Jesse Anderson

Jesse Anderson

Big Data Institute

Jesse Anderson is a data engineer, creative engineer, and managing director of the Big Data Institute. Jesse trains employees on big data—including cutting-edge technology like Apache Kafka, Apache Hadoop, and Apache Spark. He’s taught thousands of students at companies ranging from startups to Fortune 100 companies the skills to become data engineers. He’s widely regarded as an expert in the field and recognized for his novel teaching practices. Jesse is published by O’Reilly and Pragmatic Programmers and has been covered in such prestigious media outlets as the Wall Street Journal, CNN, BBC, NPR, Engadget, and Wired. You can learn more about Jesse at Jesse-Anderson.com.