Skip to main content
Make Data Work
Oct 15–17, 2014 • New York, NY
Sean Owen

Sean Owen
Director of Data Science, Cloudera

Website | @sean_r_owen

Sean is Director of Data Science for EMEA at Cloudera, helping customers build large-scale machine learning solutions on Hadoop. Previously, Sean founded Myrrix Ltd, producing a real-time recommender and clustering product evolved from Mahout. Myrrix is now part of Cloudera. Sean was primary author of recommender components in Apache Mahout, and has been an active committer and PMC member for the project. He is co-author of Mahout in Action.


5:05pm–5:45pm Thursday, 10/16/2014
Hadoop & Beyond
Location: 1 E20/1 E21
Sean Owen (Cloudera)
Average rating: ****.
(4.73, 11 ratings)
Apache Spark is a popular new paradigm for computation on Hadoop. It's particularly effective for iterative algorithms relevant to data science like clustering, which can be used to detect anomalies in data. Curious? Get a taste of Spark MLlib, Scala and k-means clustering in this walkthrough of anomaly detection as applied to network intrusion, using the KDD Cup '99 data set. Read more.
10:30am–11:00am Friday, 10/17/2014
Office Hour
Location: Table B
Sean Owen (Cloudera)
As the Director of Data Science at Cloudera, Sean is ready and willing to talk about large-scale machine learning on Hadoop, connecting R, SAS, and other software to Hadoop for analytics, and using Spark, MLlib, and Mahout. Read more.