Guillaume Salou shares OVH’s approach to continuous deployment of machine learning models, which involved building a full stack of automated machine learning. Automated machine learning allows the company to rebuild models efficiently and keep models up to date with fresh data brought by its data convergence tool.
In most offices, DevOps and data scientists are on separate teams. OVH has merged the teams so that data scientists can access DevOps improvements like continuous delivery and DevOps can access data scientists’ knowledge. The continuous delivery of models is not as easy as building and deploying an application. First, raw data must be transformed into features, which are then preprocessed. Only then can you train and build a model. To achieve the best results, you must test different types of models associated with variables called hyperparameters. OVH monitors model performance and chooses the best one.
Guillaume discusses OVH’s first shared project, public cloud instance fraud detection. For this project, it was necessary to continually and automatically keep the model up to date in production and fed by fresh data. Guillaume outlines the architecture for the project, built on open source software like Jupyter, CDS, Warp 10, openscoring, PMML, and scikit-learn. This approach is pragmatically led by a metrics data platform. For now, this is basically an autoML solution, rebuilt daily by batch. The solution is efficient but insufficient. Guillaume explains how OVH is laying the next steps of a streamed fully automated machine learning platform that will allow data scientists to work on Stage-Gate innovation processes and efficiently go to production.
Guillaume Salou is the machine learning services team leader at OVH, where he is focusing on extracting high value from specific data science applications in order to make it available to all. Previously, he worked on data lakes.
For exhibition and sponsorship opportunities, email strataconf@oreilly.com
For information on trade opportunities with O'Reilly conferences, email partners@oreilly.com
View a complete list of Strata Data Conference contacts
©2018, O’Reilly UK Ltd • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • confreg@oreilly.com