Presented By O’Reilly and Cloudera
Make Data Work
September 11, 2018: Training & Tutorials
September 12–13, 2018: Keynotes & Sessions
New York, NY

Building turnkey recommendations for 5% of internet video

Nir Yungster (JW Player), Kamil Sindi (JW Player)
4:20pm–5:00pm Thursday, 09/13/2018
Secondary topics:  Deep Learning, Media, Marketing, Advertising, Recommendation Systems

Who is this presentation for?

  • Data engineers and data scientists

Prerequisite knowledge

  • Familiarity with data science models, such as convolutional neural networks, and data engineering tools, such as Docker, Elasticsearch, and TensorFlow

What you'll learn

  • Learn how to deliver recommendations as a service architecturally and build a process to systematically improve recommendations

Description

JW Player—the world’s largest network-independent video platform, representing 5% of global internet video—provides on-demand recommendations as a service to thousands of media publishers, driving higher engagement and retention among their viewers. For the thousands of publishers using this service, sites such as Business Insider, Refinery29, Hearst, and USA Today, this translates directly to increased advertising dollars and is thus a major focus for algorithmic improvement on the part of JW Player’s data science team.

Nir Yungster and Kamil Sindi explain how the company is systematically improving model performance while navigating the many engineering challenges and unique needs of the diverse publishers it serves.

Topics include:

  • Building an MVP service that is scalable and tuned to each publisher’s content and use case, including both associative-based models built on a Kafka-Storm-Redis pipeline and a content-based approach that relies on Elasticsearch for delivery, with a high-availability service handling 20K requests a second.
  • Building an offline-online iteration process to systematically improve performance: Some of the more interesting discussion areas here include how JW Player defines model performance metrics for thousands of publishers with differing content and viewer contexts and A/B testing that shows that changes in algorithms can lead to dramatically different outcomes depending on the publisher. Some of those changes are inherent to the content type (e.g., visitors to a news site like USA Today will naturally be more sensitive to the age of content than visitors to a general information site like Investopedia, where the relevance of content is stable over time). Other differences arise from implementation differences (e.g., the success of content-based recommendations can depend on the diversity of a publisher’s library or the quality of titles and descriptions from its editors).
  • Using a deep learning framework to build personalized recommendations: The reasons for making this change include helping unify content and association-based approaches (instead of separate models, they are features/inputs); providing a flexible platform for adding new features (e.g., NLP-based video categorization and viewer preferences and NLP/audio/visual-based psychographic features for videos, such as valence, arousal, and dominance—and viewer preferences for each of these); and using A/B testing to help refine offline training metrics for deep learning models.
Photo of Nir Yungster

Nir Yungster

JW Player

Nir Yungster leads the data science team at JW Player, focusing on providing recommendations as a service to thousands of online video publishers. Nir studied aerospace engineering at Princeton University and holds a master’s degree in applied mathematics from Northwestern University.

Photo of Kamil Sindi

Kamil Sindi

JW Player

Kamil Sindi is a principal engineer at JW Player, where he works on productionizing machine learning algorithms and scaling distributed systems. He holds a bachelor’s degree in mathematics with computer science from the Massachusetts Institute of Technology.