Sep 23–26, 2019
Please log in

Professional Kafka development

Jesse Anderson (Big Data Institute)
9:00am—5:00pm Monday, September 23—Tuesday, September 24
Location: 1E 06
Average rating: *****
(5.00, 3 ratings)

Participants should plan to attend both days of training course. Note: to attend training courses, you must be registered for a Platinum or Training pass; does not include access to tutorials on Tuesday.

Jesse Anderson offers you an in-depth look at Apache Kafka. You'll learn how Kafka works and how to create real-time systems with it, as well as how to create consumers and publishers. You'll take a look Jesse then walks you through Kafka’s ecosystem, demonstrating how to use tools like Kafka Streams, Kafka Connect, and KSQL.

What you'll learn, and how you can apply it

  • Learn how to create large-scale real-time systems using Apache Kafka, how to use Apache Avro with Kafka, how to use Kafka Streams to create data pipelines, and how to configure Kafka Connect to put data into and take data out of Kafka
  • Understand how real-time distributed systems are different from batch systems and best practices and common architectural patterns when creating solutions with Kafka
  • Discover how to create Kafka producers and consumers
  • See how to use your existing SQL skills with KSQL

Who is this presentation for?

  • You're a software engineer, data engineer, or data scientist.

Level

Intermediate

Prerequisites:

  • A working knowledge of Java
  • Familiarity with big data technologies (useful but not required)

Hardware and/or installation requirements:

  • A laptop (at least 3 GB of free RAM, 10 GB of free disk space, and a 64-bit processor with VT-X enabled) with VirtualBox installed

Outline

Day 1

Data at scale

  • Data movement concepts
  • Moving data at scale

Kafka concepts

  • Kafka system
  • Basic concepts
  • Advanced concepts

Developing with Kafka

  • Using Apache Maven
  • Kafka API
  • Kafka API caveats

Advanced Kafka development

  • Advanced consumers and producers
  • Advanced Offset Handling
  • Transactions
  • Multithreading consumers

Day 2

Kafka and Avro

  • Why serialize
  • Avro and serialization formats

Kafka Connect

  • Using Kafka Connect
  • Importing from JDBC
  • Exporting to HDFS

Kafka Streams

  • Kafka Streams
  • Kafka Streams API

KSQL

  • Using KSQL

About your instructor

Photo of Jesse Anderson

Jesse Anderson is a data engineer, creative engineer, and managing director of the Big Data Institute. Jesse trains employees on big data—including cutting-edge technology like Apache Kafka, Apache Hadoop, and Apache Spark. He’s taught thousands of students at companies ranging from startups to Fortune 100 companies the skills to become data engineers. He’s widely regarded as an expert in the field and recognized for his novel teaching practices. Jesse is published by O’Reilly and Pragmatic Programmers and has been covered in such prestigious media outlets as the Wall Street Journal, CNN, BBC, NPR, Engadget, and Wired. You can learn more about Jesse at Jesse-Anderson.com.

Twitter for jessetanderson

Conference registration

Get the Platinum pass or the Training pass to add this course to your package.

Comments on this page are now closed.

Comments

pani manchella | Lead Data Architect
09/21/2019 1:53pm EDT

I have surface book, installed virtual box, downloaded the image. imported the image
VM is not starting.

VT-X is enabled by default and this option is not available in Bios.
Hyper-V is disabled in windows options.

any help is appreicated

Picture of Jesse Anderson
Jesse Anderson | Managing Director
09/09/2019 2:13pm EDT

@Harry, you will receive an email from O’Reilly with the information for the course. This will include a link to the VM image and other course contents. If you don’t receive this email before the class, please reach out to O’Reilly or check your spam folder.

Harry Butler | Senior Application Developer
09/09/2019 9:47am EDT

I have a question on the hardware requirements. How will we install the virtual environment onto Virtual Box? Will it be a thumb drive or will it be a download from a web-site? I have a U.S. government issued laptop and it will not allow any connections to a usb port. I can download a file from the internet if we have an internet connection. Thank you.

  • Cloudera
  • O'Reilly
  • Google Cloud
  • IBM
  • Cisco
  • Dataiku
  • Intel
  • Io-Tahoe
  • MemSQL
  • Microsoft Azure
  • Oracle Cloud Infrastructure
  • SAS
  • Arcadia Data
  • BMC Software
  • Hazelcast
  • SAP
  • Amazon Web Services
  • Anaconda
  • Esri
  • Infoworks.io, Inc.
  • Kyligence
  • Pitney Bowes
  • Talend
  • Google Cloud
  • Confluent
  • DataStax
  • Dremio
  • Immuta
  • Impetus Technologies Inc.
  • Keyence
  • Kyvos Insights
  • StreamSets
  • Striim
  • Syncsort
  • SK holdings C&C

    Contact us

    confreg@oreilly.com

    For conference registration information and customer service

    partners@oreilly.com

    For more information on community discounts and trade opportunities with O’Reilly conferences

    strataconf@oreilly.com

    For information on exhibiting or sponsoring a conference

    pr@oreilly.com

    For media/analyst press inquires