San FranciscoLondon New York

Presented By
O’Reilly + Cloudera

Make Data Work

March 25-28, 2019
San Francisco, CA

Please log in

Add to Your Schedule

The hitchhiker's guide to deep learning-based recommenders in production

Abhishek Kumar (Publicis Sapient), pramod singh (Walmart Labs )

1:30pm–5:00pm Tuesday, March 26, 2019

Data Science, Machine Learning & AI
Location: 2002

Secondary topics: Deep Learning, Media, Marketing, Advertising, Model lifecycle management

Average rating:

(4.17, 6 ratings)

Who is this presentation for?

Data scientists, ML engineers, executives in data science, and architects

Level

Beginner

Prerequisite knowledge

Familiarity with TensorFlow and recommender systems

What you'll learn

Explore deep learning use cases in the enterprise, such as content personalization and recommenders, and the corresponding reference architectures and ML/DL pipelines
Learn how to use TensorFlow Serving, MLFlow, Flask, Gunicorn, NGINX, and Locust
Understand how to implement CI/CD pipelines for ML and DL

Description

Deep learning has become the de facto AI technique employed in many enterprises to build recommender systems. Deep learning offers the ability to perform auto-feature engineering and handle feature interactions. However, though it’s well understood and most people have started using TensorFlow or PyTorch to build deep learning models, it takes a lot of time and effort and exploration to put deep learning recommender models into production.

Abhishek Kumar and Pramod Singh walk you through deep learning-based recommender and personalization systems they’ve built for clients. Join in to learn how to use TensorFlow Serving and MLflow for end-to-end productionalization, including model serving, Dockerization, reproducibility, and experimentation, and Kubernetes for deployment and orchestration of ML-based microarchitectures. You’ll explore applications of deep learning, such as using LSTM models for forecasting and sequence embedding (and consequently, for predicting the next best action of customers in a personalization context) and using CNNs for image classification. You’ll also see how to do A/B testing to determine the improvement in clickthrough rate and lift for production deployments.

Topics include:

Reference architectures for microservices and deep learning-based recommender and content personalization systems
How to leverage TensorFlow Serving for model serving and augment it with model management tools
How to use MLflow for end-to-end productionalization, including model serving, Dockerization, reproducibility, and experimentation
Creating ML and DL APIs using NGINX/Gunicorn for load balancing and deploying ML-based microservices architecture using Docker and Kubernetes
Load testing tools such as Locust and how they help in benchmarking ML and DL API performance in a distributed and scalable fashion
Creating real-time pipelines using Kafka and Spark Structured Streaming
Creating batch pipelines using Apache AirFlow
Creating CI/CD pipelines for ML and DL
A comparison of this productionization mechanism with Airbnb’s BigHead

Abhishek Kumar

Publicis Sapient

Abhishek Kumar is a senior manager of data science in Publicis Sapient’s India office, where he looks after scaling up the data science practice by applying machine learning and deep learning techniques to domains such as retail, ecommerce, marketing, and operations. Abhishek is an experienced data science professional and technical team lead specializing in building and managing data products from conceptualization to the deployment phase and interested in solving challenging machine learning problems. Previously, he worked in the R&D center for the largest power-generation company in India on various machine learning projects involving predictive modeling, forecasting, optimization, and anomaly detection and led the center’s data science team in the development and deployment of data science-related projects in several thermal and solar power plant sites. Abhishek is a technical writer and blogger as well as a Pluralsight author and has created several data science courses. He’s also a regular speaker at various national and international conferences and universities. Abhishek holds a master’s degree in information and data science from the University of California, Berkeley. Abhishek has spoken at past O’Reilly conferences, including Strata 2019, Strata 2018, and AI 2019.

Website

pramod singh

Walmart Labs

Pramod Singh is a senior machine learning engineer at Walmart Labs. He has extensive hands-on experience in machine learning, deep learning, AI, data engineering, designing algorithms, and application development. He has spent more than 10 years working on multiple data projects at different organizations. He’s the author of three books Machine Learning with PySpark, Learn PySpark, and Learn TensorFlow 2.0. He’s also a regular speaker at major conferences such as the O’Reilly Strata Data and AI Conferences. Pramod holds a BTech in electrical engineering from BATU, and an MBA from Symbiosis University. He’s also done data science certification from IIM–Calcutta. He lives in Bangalore with his wife and three-year-old son. In his spare time, he enjoys playing guitar, coding, reading, and watching football.

Website

Presented by

Strategic Sponsors

Zettabyte Sponsor

Contributing Sponsors

Exabyte Sponsors

Impact Sponsors

Supporting Sponsor

Sponsorship Opportunities

For exhibition and sponsorship opportunities, email strataconf@oreilly.com

Partner Opportunities

For information on trade opportunities with O'Reilly conferences, email partners@oreilly.com

Contact Us

View a complete list of Strata Data Conference contacts

©2019, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • confreg@oreilly.com