Skip to main content
Make Data Work
Oct 15–17, 2014 • New York, NY
Paco Nathan

Paco Nathan
Evil Mad Scientist,

Website | @pacoid

O’Reilly author (Enterprise Data Workflows with Cascading and the new “Just Enough Math”) and a “player/coach” who’s led innovative Data teams building large-scale apps. OSS evangelist for Apache Spark (Databricks), workshop instructor (Global Data Geeks), advisor to Zettacap, The Data Guild, Amplify Partners. Expert in machine learning, cluster computing, and Enterprise use cases for Big Data. Interests: Spark, Mesos, PMML, Open Data, Cascalog, Scalding, Python for analytics, NLP.


9:00am–5:00pm Wednesday, 10/15/2014
Hadoop & Beyond
Location: Hall A 23/24
Paco Nathan (, michael dddd (Databricks), Tathagata Das (Databricks), Matei Zaharia (Databricks), Reynold Xin (Databricks), Ameet Talwalkar (Carnegie Mellon University | Determined AI), Holden Karau (Independent), Joseph Bradley (Databricks), Sameer Farooqui (Databricks), Patrick Wendell (Databricks)
Average rating: ***..
(3.75, 20 ratings)
Spark Camp, organized by the creators of the Apache Spark project at Databricks, will be a day long hands-on introduction to the Spark platform including Spark Core, the Spark Shell, Spark Streaming, Spark SQL, MLlib, and more. Read more.
1:30pm–5:00pm Wednesday, 10/15/2014
Business & Industry
Location: 1 E6/1 E7
Paco Nathan (, Allen Day (MapR Technologies)
Average rating: ***..
(3.46, 13 ratings)
Advanced math for business people: “just enough math” to take advantage of new classes of open source frameworks. Many take college math up to calculus, but never learn how to approach sparse matrices, complex graphs, or supply chain optimizations. This tutorial ties these pieces together into a conceptual whole, with use cases and simple Python code, as a new approach to computational thinking. Read more.
11:00am–11:40am Thursday, 10/16/2014
Office Hour
Location: Table A
Matei Zaharia (Databricks), michael dddd (Databricks), Paco Nathan (, Tathagata Das (Databricks)
This is your chance to corner a gaggle of Apache Spark developers about the latest updates, use cases and deployment, and ways you can get started with Spark. Read more.
2:35pm–3:15pm Thursday, 10/16/2014
Office Hour
Location: Table A
Paco Nathan (, Allen Day (MapR Technologies)
Paco and Allen have a great perspective on DNA sequencing, biomedical data, data best practices, and future opportunities. Come by and ask them how to put business context into mathematical techniques, especially for the parts "beyond calculus" that enable high-ROI solutions today. Read more.