Presented By O'Reilly and Cloudera
Make Data Work
Feb 17–20, 2015 • San Jose, CA
Josh Baer

Josh Baer
Machine Learning Platform Product Lead, Spotify


Josh spent six years as a software engineer building infrastructure components at AT&T before discovering the world of ‘Big Data’ in a class at NYU by O’Reilly author Foster Provost. He ‘joined the band’ at Spotify in early 2013 and has worked on a small team focusing on stabilizing and enhancing the Hadoop infrastructure, performing multiple migrations, upgrades and growing the cluster from 190 nodes to over 900. Today, Josh lives in Stockholm, Sweden and leads the platform vision as the Hadoop Product Owner.

Josh holds a BS in Computer Science/Philosophy from the University of Pittsburgh and a MS in Computer Science from NYU.


10:40am–11:20am Thursday, 02/19/2015
Hadoop in Action
Location: 210 A/E
Josh Baer (Spotify), Rafał Wojdyła (Spotify)
Average rating: ****.
(4.25, 4 ratings)
There's many confusing and painful things about setting up and operating a 900 node Hadoop cluster used as the centerpiece in many of Spotify's Big Data initiatives, we'll go over a few interesting stories and frustrations which have influenced the direction of our architectural choices and the lessons we've learned from them. Read more.