Website | @pacoid
O’Reilly author (Enterprise Data Workflows with Cascading and the new “Just Enough Math”) and a “player/coach” who’s led innovative Data teams building large-scale apps. OSS evangelist for Apache Spark (Databricks), workshop instructor (Global Data Geeks), advisor to Zettacap, The Data Guild, Amplify Partners. Expert in machine learning, cluster computing, and Enterprise use cases for Big Data. Interests: Spark, Mesos, PMML, Open Data, Cascalog, Scalding, Python for analytics, NLP.
9:00am–5:00pm Wednesday, 10/15/2014
Hadoop & Beyond
Location: Hall A 23/24
Spark Camp, organized by the creators of the Apache Spark project at Databricks, will be a day long hands-on introduction to the Spark platform including Spark Core, the Spark Shell, Spark Streaming, Spark SQL, MLlib, and more.
1:30pm–5:00pm Wednesday, 10/15/2014
Business & Industry
Location: 1 E6/1 E7
Advanced math for business people: “just enough math” to take advantage of new classes of open source frameworks. Many take college math up to calculus, but never learn how to approach sparse matrices, complex graphs, or supply chain optimizations. This tutorial ties these pieces together into a conceptual whole, with use cases and simple Python code, as a new approach to computational thinking.
2:35pm–3:15pm Thursday, 10/16/2014
Location: Table A
Paco and Allen have a great perspective on DNA sequencing, biomedical data, data best practices, and future opportunities. Come by and ask them how to put business context into mathematical techniques, especially for the parts "beyond calculus" that enable high-ROI solutions today.