Presented By O'Reilly and Cloudera
Make Data Work
December 1–3, 2015 • Singapore
Sameer Farooqui

Sameer Farooqui
Client Services Engineer, Databricks

Website | @blueplastic

Sameer Farooqui is a client services engineer at Databricks, where he works with customers on Apache Spark deployments. Sameer works with the Hadoop ecosystem, Cassandra, Couchbase, and general NoSQL domain. Prior to Databricks, he worked as a freelance big data consultant and trainer globally and taught big data courses. Before that, Sameer was a systems architect at Hortonworks, an emerging data platforms consultant at Accenture R&D, and an enterprise consultant for Symantec/Veritas (specializing in VCS, VVR, and SF-HA).

Sessions

9:00am–5:00pm Tuesday, 12/01/2015
SOLD OUT
Spark & Beyond
Location: 328-329 Level: Intermediate
Sameer Farooqui (Databricks), Paco Nathan (derwen.ai), Reynold Xin (Databricks)
Average rating: ****.
(4.00, 20 ratings)
The real power and value proposition of Apache Spark is in building a unified use case that combines ETL, batch analytics, real-time stream analysis, machine learning, graph processing and visualizations. In class we will explore various Wikipedia datasets while applying the ideal programming paradigm for each analysis. The class will comprise of about 50% lecture and 50% hands on labs + demos. Read more.