San FranciscoLondon New York

Presented By
O’Reilly + Cloudera

Make Data Work

March 25-28, 2019
San Francisco, CA

Please log in

Add to Your Schedule

Professional Kafka development

Jesse Anderson (Big Data Institute)

Monday, March 25 & Tuesday, March 26, 9:00am - 5:00pm

Data Engineering & Architecture
Location: 3016

Secondary topics: Streaming, realtime analytics, and IoT

Average rating:

(3.00, 1 rating)

Participants should plan to attend both days of this 2-day training course. To attend training courses, you must register for a Platinum or Training pass; does not include access to tutorials on Tuesday.

Jesse Anderson leads a deep dive into Apache Kafka. You'll learn how Kafka works and how to create real-time systems with it. You'll also discover how to create consumers and publishers in Kafka and how to use Kafka Streams, Kafka Connect, and KSQL as you explore the Kafka ecosystem.

What you'll learn, and how you can apply it

Learn how to create large-scale real-time systems using Apache Kafka
Understand how real-time distributed systems are different from batch systems
Learn how to create Kafka producers and consumers
Discover how to use Apache Avro with Kafka for truly enterprise-grade solutions
Learn how to use Kafka Streams to create data pipelines
Understand how to configure Kafka Connect to put data into and take out of Kafka
Find out how to use your existing SQL skills with KSQL
Explore best practices and common architectural patterns when creating solutions with Kafka

This training is for you because...

You're a software engineer, data engineer, or data scientist.

Prerequisites:

A working knowledge of programming in Java
A basic understanding of big data technologies (useful but not required)

Hardware and/or installation requirements:

A laptop (at least 3 GB of free RAM, 10 GB of free disk space, and a 64-bit processor with VT-X enabled) with VirtualBox installed

Jesse Anderson leads a deep dive into Apache Kafka. You’ll learn how Kafka works and how to create real-time systems with it. You’ll also discover how to create consumers and publishers in Kafka and how to use Kafka Streams, Kafka Connect, and KSQL as you explore the Kafka ecosystem.

Outline

Day 1

Data at scale

Data movement concepts
Moving data at scale

Kafka concepts

Kafka system
Basic concepts
Advanced concepts

Developing with Kafka

Using Apache Maven
Kafka APIs
Kafka API caveats

Advanced Kafka development

Advanced consumers and producers
Advanced offset handling
Transactions
Multithreading consumers

Day 2

Kafka and Avro

Why serialize
Avro and serialization formats

Kafka Connect

Using Kafka Connect
Importing from JDBC
Exporting to HDFS

Kafka Streams

Kafka Streams
The Kafka Streams API

KSQL

Using KSQL

Wrap-up and Q&A

About your instructor

Jesse Anderson is a data engineer, creative engineer, and managing director of the Big Data Institute. Jesse trains employees on big data—including cutting-edge technology like Apache Kafka, Apache Hadoop, and Apache Spark. He’s taught thousands of students at companies ranging from startups to Fortune 100 companies the skills to become data engineers. He’s widely regarded as an expert in the field and recognized for his novel teaching practices. Jesse is published by O’Reilly and Pragmatic Programmers and has been covered in such prestigious media outlets as the Wall Street Journal, CNN, BBC, NPR, Engadget, and Wired. You can learn more about Jesse at Jesse-Anderson.com.

Conference registration

Get the Platinum pass or the Training pass to add this course to your package.

Comments on this page are now closed.

Comments

Jesse Anderson | MANAGING DIRECTOR

11/27/2018 1:21am PST

You’ll need an intermediate level of Java knowledge. The Kafka API won’t push your knowledge of Java. As long you know how to program, you should be to at least understand the code and system creation.

Diego Labrador | DATA ARCHITECT

11/27/2018 1:13am PST

Hi, can you please be more specific about the java level needed to really take the best of this training?

Regards

Presented by

Strategic Sponsors

Zettabyte Sponsor

Contributing Sponsors

Exabyte Sponsors

Impact Sponsors

Supporting Sponsor

Sponsorship Opportunities

For exhibition and sponsorship opportunities, email strataconf@oreilly.com

Partner Opportunities

For information on trade opportunities with O'Reilly conferences, email partners@oreilly.com

Contact Us

View a complete list of Strata Data Conference contacts

©2019, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • confreg@oreilly.com