Predictive applications unlock tremendous value for businesses through intelligent use of data. From personalized recommenders, targeted advertising, to customer churn prediction and influence analysis, predictive applications can enhance the customer experience and increase business revenue. In order to make decisions such as which ad to show to which user, what items to recommend, and which customers are most likely to churn, these smart apps rely on sophisticated analysis machine learning algorithms and models. Machine learning techniques are powerful, but building and deploying such models require a lot of care and expertise.
With a new generation of tools, it is possible to build such applications with a minimal amount of algorithm expertise. However, one must know how to evaluate, test, and track the performance of these machine learning models over time. This talk engages potential application builders on topics such as common evaluation metrics, A/B testing set up, tracking model performance, tracking usage via real-time feedback, and updating models. The talk will go into details on how to read the tea leaves of model training and prediction output, and understand “distribution drift.” We will also show practical demonstrations of these ideas using GraphLab Create and other open-source tools.
Alice is the director of data science at GraphLab, a Seattle-based startup that offers powerful large-scale machine learning and graph analytics tools. She loves playing with data and enabling others to play with data. She is a tool builder and an expert in machine learning algorithms. Her research spans software diagnosis, computer network security, and social network analysis. Prior to joining GraphLab, she was a researcher at Microsoft Research, Redmond. She holds Ph.D. and B.A. degrees in computer science, and a B.A. in mathematics, all from U.C. Berkeley.
©2015, O’Reilly UK Ltd • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • email@example.com
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.