Presented By O'Reilly and Cloudera
Make Data Work
31 May–1 June 2016: Training
1 June–3 June 2016: Conference
London, UK

Introduction to Apache Spark for Java and Scala developers

Ted Malaska (Capital One), Kai Voigt (Cloudera)
14:05–14:45 Friday, 3/06/2016
Spark & beyond
Location: Capital Suite 13 Level: Intermediate
Average rating: ***..
(3.43, 7 ratings)

Prerequisite knowledge

Attendees should have a background in Java or Scala.

Description

With the rapid adoption of Apache Spark—one of the most active Apache projects today—and the need for programs to span many machines to solve the world’s greatest problems, distributed computing has resurfaced as a hot commodity that can take your career to the next level and—more importantly—open the door to some really cool and impactful apps.

Ted Malaska leads an introduction to basic Spark concepts such as DAGs, RDDs, transformations, actions, and executors, designed for Java and Scala developers. You’ll learn how your mindset must evolve beyond Java or Scala code that runs in a single JVM as you explore JVM locality, memory utilization, network/CPU usage, optimization of DAGs pipelines, and serialization conservation.

Photo of Ted Malaska

Ted Malaska

Capital One

Ted Malaska is a director of enterprise architecture at Capital One. Previously, he was the director of engineering in the Global Insight Department at Blizzard; principal solutions architect at Cloudera, helping clients find success with the Hadoop ecosystem; and a lead architect at the Financial Industry Regulatory Authority (FINRA). He has contributed code to Apache Flume, Apache Avro, Apache Yarn, Apache HDFS, Apache Spark, Apache Sqoop, and many more. Ted is a coauthor of Hadoop Application Architectures, a frequent speaker at many conferences, and a frequent blogger on data architectures.

Photo of Kai Voigt

Kai Voigt

Cloudera

Kai Voigt is a senior instructor for Hadoop classes at Cloudera, delivering training classes for developers and administrators worldwide. Kai held the same role at MySQL, Sun, and Oracle. He has spoken at a number of O’Reilly conferences.