Sep 23–26, 2019

Sameer Agarwal
Software Engineer, Facebook Inc.


Sameer Agarwal is an Apache Spark Committer and a Software Engineer at Facebook where he works as part of the Data Warehouse team on building distributed systems and databases that scale across clusters of tens of thousands of machines. He received his PhD in Databases from UC Berkeley AMPLab where he worked on BlinkDB, an approximate query engine for Spark.


1:15pm1:55pm Thursday, September 26, 2019
Location: 1A 06/07
Sameer Agarwal (Facebook Inc.)
Apache Spark is the largest compute engine at Facebook by CPU. This talk will cover the story of how we optimized, tuned and scaled Apache Spark at Facebook to run on clusters of tens of thousands of machines, processing hundreds of petabytes of data, and used by thousands of data scientists, engineers and product analysts every day. Read more.

Contact us

For conference registration information and customer service

For more information on community discounts and trade opportunities with O’Reilly conferences

For information on exhibiting or sponsoring a conference

Contact list

View a complete list of Strata Data Conference contacts