Presented By O'Reilly and Cloudera
Make Data Work
Sept 29–Oct 1, 2015 • New York, NY

media conference sessions

11:20am–12:00pm Thursday, 10/01/2015
Kurt Brown (Netflix)
Slides:   external link
The Netflix Data Platform is a constantly evolving, large scale infrastructure running in the (AWS) cloud. We are especially focused on performance and ease of use, with initiatives including Presto integration, Spark, and our big data portal and API. This talk will dive into the various technologies we use, the motivations behind our approach, and the business benefits we get.
2:55pm–3:35pm Wednesday, 09/30/2015
Juan Huerta (Dow Jones)
Slides:   1-PDF 
In this presentation I will describe the way in which Data Science is helping the Wall Street Journal produce better journalism strategies, personalize our subscribers’ experience, and optimize revenue and overall customer engagement.
1:15pm–1:55pm Wednesday, 09/30/2015
Claudia Perlich (Dstillery)
Slides:   1-PDF 
This talk takes a provocative stand: many metrics we cherish lose their value because the granularity of modern data collection enables us to identify and optimize toward hidden signals that used to be noise, and now come to the forefront. One such metric is the click-through rate in advertising, but the mechanism is ubiquitous and we should pay close attention to the mechanism at work.
4:35pm–5:15pm Thursday, 10/01/2015
Sridhar Alla (BlueWhale), Jan Neumann (Comcast)
Comcast uses Hadoop as the big data platform in several areas of its business. Their use cases have evolved in recent years and include personalization, clickthru analytics, modeling, and customer support initiatives, all adding up to billions of dollars in revenue.
1:15pm–1:55pm Thursday, 10/01/2015
Daniel Weeks (Netflix)
Slides:   1-PDF 
The Big Data Platform team at Netflix continues to push big data processing in the cloud with the addition of Spark to our platform. Recent enhancements to Spark allow us to effectively leverage it for processing against a 10+ petabyte warehouse backed by S3. We will share our experiences and performance of production jobs along with the pains and gains of deploying Spark at scale on YARN.
2:20pm–2:40pm Tuesday, 09/29/2015
Cecile Barbaroux (Schibsted Classified Media)
In this session, Cecile Barbaroux, head of data and insight, will explain how Schibsted Classified Media is reinventing itself to embrace the opportunities afforded by big data. The global company, once a group of decentralized marketplaces worldwide, is now working to become lean and data-driven.
4:35pm–5:15pm Wednesday, 09/30/2015
Adam Kelleher (Buzzfeed)
At BuzzFeed, a technology and media company, the question of “virality of content via sharing” dominates. Now, for the first time since the company was founded in 2006, data scientists can identify ways pieces of content spread across multiple social networks. In this paper, we present a close look into the way BuzzFeed defines and analyzes the virality of content.