Presented By O’Reilly and Cloudera

San Jose • London • New York

Make Data Work

March 5–6, 2018: Training
March 6–8, 2018: Tutorials & Conference
San Jose, CA

Machine learning versus machine learning in production

Manu Mukerji (8x8)

11:50am–12:30pm Wednesday, March 7, 2018

Big data and data science in the cloud, Data engineering and architecture, Media, entertainment, and advertising, Streaming systems and real-time applications
Location: LL21 E/F

Average rating:

(4.22, 9 ratings)

View slides

Who is this presentation for?

Those working on ML or AI in production

Prerequisite knowledge

A basic understanding of machine learning concepts and Hadoop

What you'll learn

Learn how Acme Corporation uses machine learning for universal catalogs

Description

Acme Corporation, a global leader in commerce marketing, classifies 4.5B products a day into ~4,500 categories using Google Taxonomy. At 600 TB of data per day, Acme Corporation has the largest Hadoop cluster in Europe. Manu Mukerji walks you through Acme Corporation’s machine learning example for universal catalogs, explaining how the training and test sets are generated and annotated; how they were created when there is no public training data available; how the model is pushed to production, automatically evaluated, and used; how Acme Corporation built a Hadoop/Spark pipeline using different types of models predicting various values; production issues that arise when applying ML at scale in production; and lessons learned along the way.

Manu Mukerji

8x8

Manu Mukerji is senior director of data, machine learning, and analytics at 8×8. Manu’s background lies in cloud computing and big data, working on systems handling billions of transactions per day in real time. He enjoys building and architecting scalable, highly available data solutions and has extensive experience working in online advertising and social media.

Comments on this page are now closed.

Comments

Manu Mukerji | SENIOR DIRECTOR, DATA, MACHINE LEARNING, AND ANALYTICS

03/12/2018 8:32am PDT

Thanks for attending the session: Here is a link to the slides: https://www.slideshare.net/ManuMukerji/machine-learning-in-production-manu-mukerji-strata-ca-march-2018

Let me know if you guys have questions or feedback

Dejan Miljkovic | PRINCIPAL ENGINEER

03/12/2018 5:16am PDT

Cool demo

Presented by

Elite Sponsors

Strategic Sponsors

Zettabyte Sponsor

Contributing Sponsors

Exabyte Sponsors

Impact Sponsors

Sponsorship Opportunities

For exhibition and sponsorship opportunities, email strataconf@oreilly.com

Partner Opportunities

For information on trade opportunities with O'Reilly conferences, email partners@oreilly.com

Contact Us

View a complete list of Strata Data Conference contacts

©2018, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • confreg@oreilly.com