TUTORIAL PREREQUISITES
Welcome to Machine Learning 101! We will be learning the basics of machine learning by building real applications: recommender and image analysis with deep learning.
The tutorial landing page (https://dato.com/events/training/2015-strata-nyc.html) contains the agenda for the day as well as relevant material. Please follow the instructions for setting up your laptop PRIOR to arriving onsite.
If you run into any problems during installation, please email training@dato.com.
TUTORIAL DESCRIPTION
This hands-on tutorial provides a quick start to building intelligent business applications using machine learning. Learn about machine learning basics, feature engineering, recommender systems, and deep learning. We will build and deploy large-scale machine learning applications with Dato’s Machine Learning platform: GraphLab Create, Dato Distributed, and Dato Predictive Services. The program will center around building two applications: a content-based recommender that tells you which talks you might be interested in at Strata, and an image search application built using deep learning.
We will walk you through all the steps of prototyping and production: data cleaning, feature engineering, model building and evaluation, and deployment.
Please check back here prior to the tutorial date for installation instructions.
Topics:
Alice Zheng leads the machine learning optimization team on Amazon’s advertising platform. She specializes in research and development of machine learning methods, tools, and applications. Outside of work, she is writing a book, Mastering Feature Engineering. Previously, Alice worked at GraphLab/Dato/Turi, where she led the machine learning toolkits team and spearheaded user outreach. Prior to joining GraphLab, she was a researcher in the Machine Learning group at Microsoft Research, Redmond. Alice holds PhD and BA degrees in computer science and a BA in mathematics, all from UC Berkeley.
Chris DuBois is a data scientist focused on building tools for other data scientists. At Dato, Chris has helped design and implement tools for creating recommendation systems and for large-scale text analysis. His current work makes it simpler to train models that generalize well. After studying applied mathematics at Pomona College, he earned a PhD in statistics from the University of California, Irvine, where he researched latent variable models for social-network data occurring over time.
Piotr Teterwak works on the toolkit development team at Dato. He received a BA in computer science from Dartmouth College, where he conducted work exploring the learning of convolutional deep neural nets with applications in computer vision.
Krishna Sridhar is a data scientist at Dato. He holds a PhD in computer science from the University of Wisconsin-Madison, where he worked on high-performance software for large-scale problems in mathematical optimization and data analysis. Krishna’s work has been used in applications such as healthcare, industrial production planning, and machine learning.
Comments on this page are now closed.
For exhibition and sponsorship opportunities, email stratahadoop@oreilly.com
For information on trade opportunities with O'Reilly conferences, email partners@oreilly.com
View a complete list of Strata + Hadoop World contacts
©2015, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • conf-webmaster@oreilly.com
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.
Comments
Hi Celso,
The material/instructions can be found here: https://dato.com/events/training/2015-strata-nyc.html
This page will be updated soon with the link above.
Let us know if you run into any problems with the setup!
Hi, when materials and/or downloads will be available?
Thanks.