Efficient ML engineering: Tools and best practices
Who is this presentation for?
- Practicing data scientists and data engineers and CDOs with a mandate to build a team inside the organization
Business value comes from solving real needs by putting models into production. You need to be able to move ML models efficiently from research to deployment at enterprise scale. Part of the answer is about using the right workflow, and the other part is about choosing the right tools. The recent rise of the ML engineer is in large part due to evolving workflow best practices: just as DevOps folks have been working at the intersection of development and operations, today, ML engineers are working at the intersection of data science and software engineering—that is, ML ops. These folks must be integrated into the team with efficient tools and effective support. Manifold developed the Lean AI process and the open source Orbyter package for Docker-first data science to streamline the development process and help companies put successful models into production as smoothly and efficiently as possible. Even if you’ve never used Docker before, Orbyter makes containerization simple and elegant—which in turn makes your team’s work seamless and clean.
Sourav Dey and Jakov Kucan walk you through the six steps of the Lean AI process and explain how it helps your ML engineers work as an an integrated part of your development and production teams. You’ll get a hands-on example using real-world data, so you can get up and running with Docker and Orbyter and see firsthand how streamlined they can make your workflow. They cover creating an AI specification by understanding both your business and your data; using containerized data science for cleaner workflows (no experience needed); developing ML engineering as a core competency; being deliberate, disciplined, and coordinated with your process; and deploying seamlessly at production scale.
- A basic understanding of the software engineering process
- Familiarity with machine learning vocabulary (model, training, etc.)
Materials or downloads needed in advance
- A laptop
What you'll learn
- Discover how to get value from machine learning in a way that will affect the company's bottom line by building teams of data scientists and engineers that are well integrated into organizational teams delivering models into production
Sourav Dey is CTO at Manifold, an artificial intelligence engineering services firm with offices in Boston and Silicon Valley. Previously, Sourav led teams building data products across the technology stack, from smart thermostats and security cams at Google Nest to power grid forecasting at AutoGrid to wireless communication chips at Qualcomm. He holds patents for his work, has been published in several IEEE journals, and has won numerous awards. He holds PhD, MS, and BS degrees in electrical engineering and computer science from MIT.
Jakov Kucan is a senior architect at Manifold, an artificial intelligence engineering services firm with offices in Boston and Silicon Valley. Previously, Jakov was chief architect at Kyruus and director of product strategy at PTC. He’s a skilled architect and engineer, able to see through the details of implementations, keep track of the dependencies within a large design, and communicate the vision and ideas to both technical and nontechnical audiences. Jakov earned his PhD in computer science from MIT and his MA degree in mathematics and BSE degree in computer science and engineering from the University of Pennsylvania. He’s an author of several publications and patent applications.
Leave a Comment or Question
Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?
Join the conversation here (requires login)
For conference registration information and customer service
For more information on community discounts and trade opportunities with O’Reilly conferences
For information on exhibiting or sponsoring a conference
View a complete list of Strata Data Conference contacts