Behzad Bordbar demonstrates how to implement typical data science workflows using Apache Spark. You’ll learn how to wrangle and explore data using Spark SQL DataFrames and how to build, evaluate, and tune machine learning models using Spark MLlib. Demonstrations and exercises will be conducted in Python using Cloudera Data Science Workbench.
Behzad Bordbar is a mathematician, software engineer, and big data technical instructor at Cloudera, where he teaches courses on Hadoop, Hive, Impala, and Spark. Behzad has worked in academia for over 12 years and has been a visiting scientist at HP, BT, and IBM.
Get the Platinum pass or the Training pass to add this course to your package. .
Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?
Join the conversation here (requires login)
©2018, O’Reilly UK Ltd • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org