Presented By O'Reilly and Cloudera
Make Data Work
December 1–3, 2015 • Singapore

Spark & Beyond conference sessions

Tuesday, December 1

9:00am–5:00pm Tuesday, 12/01/2015
Location: 328-329 Level: Intermediate
Sameer Farooqui (Databricks), Paco Nathan (, Reynold Xin (Databricks)
Average rating: ****.
(4.00, 20 ratings)
The real power and value proposition of Apache Spark is in building a unified use case that combines ETL, batch analytics, real-time stream analysis, machine learning, graph processing and visualizations. In class we will explore various Wikipedia datasets while applying the ideal programming paradigm for each analysis. The class will comprise of about 50% lecture and 50% hands on labs + demos. Read more.