Most people are surprised to know that Spark works with Java. Maybe they saw the initial Java code that used anonymous classes and dismissed it as an ungainly mess. They were right—plain Java and Spark are ugly together. Then Java 8’s lambdas came along. Now, instead of an ungainly mess, we get the tight syntax of lambda expressions offering code that is readable and testable. Best of all, it uses Java.
Jesse Anderson demonstrates how to create Java lambdas and integrate them with Spark to process data. Then Jesse explains how to find Java resources about Spark and outlines the pros and cons for a Java developer to learn Scala.
Learning two big concepts at once is often a nonstarter. People learning Spark are often learning Scala at the same time. If we can remove one big concept (learning Scala), people will be more successful at learning Spark.
Jesse Anderson is a data engineer, creative engineer, and managing director of the Big Data Institute. Jesse trains employees on big data—including cutting-edge technology like Apache Kafka, Apache Hadoop, and Apache Spark. He’s taught thousands of students at companies ranging from startups to Fortune 100 companies the skills to become data engineers. He’s widely regarded as an expert in the field and recognized for his novel teaching practices. Jesse is published by O’Reilly and Pragmatic Programmers and has been covered in such prestigious media outlets as the Wall Street Journal, CNN, BBC, NPR, Engadget, and Wired. You can learn more about Jesse at Jesse-Anderson.com.
©2016, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • email@example.com
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.