Presented By O'Reilly and Cloudera
Make Data Work
Feb 17–20, 2015 • San Jose, CA
Patrick Wendell

Patrick Wendell
Cofounder of Databricks, Databricks

Website

Patrick Wendell is an engineer at Databricks as well as a Spark
Committer and PMC member. In the Spark project, Patrick has acted as
release manager for several Spark releases, including Spark 1.0 and 1.1.
Patrick also maintains several subsystems of Spark’s core engine.
Before helping start Databricks, Patrick obtained an M.S. in Computer
Science at UC Berkeley. His research focused on low latency scheduling
for large scale analytics workloads. He holds a B.S.E in Computer
Science from Princeton University.

Sessions

11:30am–12:10pm Friday, 02/20/2015
Spark in Action
Location: 230 C
Patrick Wendell (Databricks)
Average rating: *****
(5.00, 3 ratings)
Apache Spark is a popular engine for large scale analytics. This talk will give insights into tuning and debugging a production Spark deployment. It will start with details about Spark internals and an overview of the runtime behavior of a Spark application. I'll explain how to diagnose performance bottlenecks and get the best performance out of Spark jobs. Read more.