With the rapid adoption of Apache Spark—one of the most active Apache projects today—and the need for programs to span many machines to solve the world’s greatest problems, distributed computing has resurfaced as a hot commodity that can take your career to the next level and—more importantly—open the door to some really cool and impactful apps.
Ted Malaska leads an introduction to basic Spark concepts such as DAGs, RDDs, transformations, actions, and executors, designed for Java and Scala developers. You’ll learn how your mindset must evolve beyond Java or Scala code that runs in a single JVM as you explore JVM locality, memory utilization, network/CPU usage, optimization of DAGs pipelines, and serialization conservation.
Ted Malaska is a director of enterprise architecture at Capital One. Previously, he was the director of engineering in the Global Insight Department at Blizzard; principal solutions architect at Cloudera, helping clients find success with the Hadoop ecosystem; and a lead architect at the Financial Industry Regulatory Authority (FINRA). He has contributed code to Apache Flume, Apache Avro, Apache Yarn, Apache HDFS, Apache Spark, Apache Sqoop, and many more. Ted is a coauthor of Hadoop Application Architectures, a frequent speaker at many conferences, and a frequent blogger on data architectures.
Kai Voigt is a senior instructor for Hadoop classes at Cloudera, delivering training classes for developers and administrators worldwide. Kai held the same role at MySQL, Sun, and Oracle. He has spoken at a number of O’Reilly conferences.
©2016, O’Reilly UK Ltd • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.