Attendees should have basic familiarity with the Linux command line and Java or Python, although sample solutions will also be provided for those who are not developers. No prior knowledge of Kafka is required.
The tutorial includes some hands-on exercises. If you want to follow along, you'll need a laptop with at least 4 GB of RAM and VirtualBox installed. Once you have installed VirtualBox, please download the virtual machine and Exercise Manual. Note that your laptop must be capable of running a 64-bit guest virtual machine; the easiest way to test this is to download the VM, launch it (double-click the .vbox file), and ensure it starts up. If it does not start properly, check your machine’s BIOS and ensure that VT-x is enabled.
Ian Wrigley leads a hands-on workshop on leveraging the capabilities of Apache Kafka to collect, manage, and process stream data for both big data projects and general-purpose enterprise data integration, covering key architectural concepts, developer APIs, use cases, and how to write applications that publish data to, and subscribe to data from, Kafka. Ian offers an overview of Kafka, explains how it works, and demonstrates how use it to build modern data applications, using hands-on exercises where you’ll build an application that can to publish data to Kafka and subscribe to receive data from Kafka. This tutorial is ideal for application developers, ETL (extract, transform, load) developers, or data scientists who need to interact with Kafka clusters as a source of, or destination for, stream data.
Ian Wrigley is a Technical Director at StreamSets, the company behind the industry’s first data operations platform. Over his 25-year career, Ian has taught tens of thousands of students subjects ranging from C programming to Hadoop development and administration.
Comments on this page are now closed.
©2016, O’Reilly UK Ltd • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.