Trends driving demand for automated machine learning (AML) include the growing availability of big data through Hadoop architecture and the shortage of experienced data scientists. Successful big data projects require careful consideration of project definition, success criteria, organizational design, and implementation, and executives contribute vitally to this process. Jeremy Achin teaches executives how to identify opportunities to optimize their business using machine learning. This means radically reducing time to value the total cycle time from data to predictions and broadening the pool of people who can contribute to machine-learning projects without sacrificing quality. Jeremy also introduces DataRobot, AML software that supports and reflects these best practices, and explains how DataRobot interfaces with HDFS, YARN, Apache Spark, and other key components in a Hadoop cluster.
This session is sponsored by DataRobot.
Jeremy Achin is a data scientist turned entrepreneur with a vision. In his current role as CEO of DataRobot, Jeremy sets the direction of the company, products, and culture. He’s considered a data science thought leader and spends significant time visiting the world’s largest organizations to speak about practicing data science at scale. Previously, Jeremy was director of research and modeling at Travelers Insurance, where he built predictive models for pricing, retention, conversion, elasticity, lifetime value, customer behavior, claims, and much more. A true data science enthusiast, Jeremy even spends his spare time building predictive models, usually on the data science competition platform Kaggle.com. Jeremy studied math, physics, computer science, and statistics at the University of Massachusetts, Lowell.
Tom de Godoy is co-founder and CTO of DataRobot, and has over 10 years of Data Science and engineering experience. Previously, he was Senior Director of Research and Modeling at Travelers Insurance where he managed a team of Data Scientists on applications in pricing, claims and customer behavior for various insurance products. Tom has been ranked as high as 20th in the world on the data science competition platform Kaggle.com, which boasts more than 500,000 registered Data Scientists.
Comments on this page are now closed.
©2016, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.