Mar 15–18, 2020

Professional Kafka development

Jesse Anderson (Big Data Institute)
9:00am—5:00pm
Sunday, March 15—Monday, March 16
Location: 232

Participants should plan to attend both days of training course. Note: to attend training courses, you must be registered for a Platinum or Training pass; does not include access to tutorials on Monday.

Jesse Anderson leads a deep dive into Apache Kafka. You'll learn how Kafka works and how to create real-time systems with it. You'll also discover how to create consumers and publishers in Kafka and how to use Kafka Streams, Kafka Connect, and KSQL as you explore the Kafka ecosystem.

What you'll learn, and how you can apply it

  • Learn how to create large-scale real-time systems using Apache Kafka
  • Understand how real-time distributed systems are different from batch systems
  • Learn how to create Kafka producers and consumers
  • Discover how to use Apache Avro with Kafka for truly enterprise-grade solutions
  • Learn how to use Kafka Streams to create data pipelines
  • Understand how to configure Kafka Connect to put data into and take out of Kafka
  • Find out how to use your existing SQL skills with KSQL
  • Explore best practices and common architectural patterns when creating solutions with Kafka

Who is this presentation for?

Data engineers, data architects, developers

Level

Beginner

Prerequisites:

  • A working knowledge of programming in Java
  • A basic understanding of big data technologies (useful but not required)

Hardware and/or installation requirements:

A laptop (at least 3 GB of free RAM, 10 GB of free disk space, and a 64-bit processor with VT-X enabled) with VirtualBox installed

Day 1

  • Data at scale
  • Data movement concepts
  • Moving data at scale
  • Kafka concepts
  • Kafka system
  • Basic concepts
  • Advanced concepts
  • Developing with Kafka
  • Using Apache Maven
  • Kafka APIs
  • Kafka API caveats
  • Advanced Kafka development
  • Advanced consumers and producers
  • Advanced offset handling
  • Transactions
  • Multithreading consumers

Day 2

  • Kafka and Avro
  • Why serialize
  • Avro and serialization formats
  • Kafka Connect
  • Using Kafka Connect
  • Importing from JDBC
  • Exporting to HDFS
  • Kafka Streams
  • Kafka Streams
  • The Kafka Streams API
  • KSQL
  • Using KSQL
  • Wrap-up and Q&A

About your instructor

Photo of Jesse Anderson

Jesse Anderson is a Big Data Engineering expert and trainer.

Conference registration

Get the Platinum pass or the Training pass to add this course to your package. Best Price ends January 10.

Leave a Comment or Question

Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?

Join the conversation here (requires login)

Contact us

confreg@oreilly.com

For conference registration information and customer service

partners@oreilly.com

For more information on community discounts and trade opportunities with O’Reilly conferences

Become a sponsor

For information on exhibiting or sponsoring a conference

pr@oreilly.com

For media/analyst press inquires