Machine learning’s popularity has grown tremendously in recent years. The drive to integrate machine learning into every solution has never been more pronounced, but the path from investigation to model development to implementation in production can be difficult. Amazon SageMaker AWS’s new machine learning platform seeks to make this process easier.
Machine learning starts with data, and Spark is one of the most popular and flexible solutions for handling large datasets for ETL, ad hoc analysis, and advanced machine learning. However, using Spark for production machine learning use cases can create inconsistencies in algorithm scale, conflicts over cluster resources, and prediction latencies. By offloading training to Amazon SageMaker’s highly scalable algorithms and distributed, managed training environment and deploying with SageMaker’s real-time production endpoints, implementing machine learning in production is easier and more reliable.
David Arpin walks you through building a machine learning application, from data manipulation to algorithm training to deployment to a real-time prediction endpoint, using Spark and Amazon SageMaker.
Comments on this page are now closed.
For exhibition and sponsorship opportunities, email strataconf@oreilly.com
For information on trade opportunities with O'Reilly conferences, email partners@oreilly.com
View a complete list of Strata Data Conference contacts
©2018, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • confreg@oreilly.com
Comments
Hi – are the sample notebooks or slides going to be shared for this presentation? Thanks