Data scientists build information platforms to provide deep insight and answer previously unimaginable questions. Spark and Hadoop are transforming how data scientists work by allowing interactive and iterative data analysis at scale.
Maojin Jiang demonstrates how Spark and Hadoop enable data scientists to help companies reduce costs, increase profits, improve products, retain customers, and identify new opportunities. Through in-class simulations and exercises, Maojin walks you through applying data science methods to real-world challenges in different industries, offering preparation for data scientist roles in the field.
Maojin Jiang is an instructor at Cloudera. Previously, Maojin worked as a big data engineer, a software engineer, a DevOps developer, a system administrator, and a researcher with interests in topic-sentiment analysis, information retrieval, web mining, political text analysis, machine learning, and natural language processing. In early 2012, Maojin introduced Cloudera Hadoop training into mainland China. Since then, he has dedicated himself to driving widespread adoption of Hadoop-based big data technologies by helping hundreds of engineers, architects, IT managers, executives, and university students and their teachers gain knowledge of Hadoop-based big data technologies, including Cloudera’s industry-leading and world-recognized best practices and solutions.
©2016, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.