Presented By O'Reilly and Cloudera
Make Data Work
31 May–1 June 2016: Training
1 June–3 June 2016: Conference
London, UK

Securing Apache Spark on production Hadoop clusters

Kostas Sakellis (Cloudera)
17:25–18:05 Thursday, 2/06/2016
Spark & beyond
Location: Capital Suite 13 Level: Intermediate
Average rating: ****.
(4.00, 4 ratings)

Prerequisite knowledge

Attendees should have a general understanding of Spark.


As Spark is used more and more frequently for production workloads with stringent security requirements, fully locking down Spark applications has become critical. Kostas Sakellis explores the various facets of securing your Spark application and discusses open challenges and future work to improve overall security.

Topics include:

  • Authentication: how Spark utilizes Kerberos for application authentication with the same delegation token mechanism used by Apache Hadoop
  • Authorization: how Spark integrates with Hive and Sentry to provide fine-grained role-based access control to data
  • Encryption: how Spark protects sensitive data by utilizing encryption on wire and on disk

Kostas Sakellis


Kostas Sakellis is the lead and engineering manager of the Apache Spark team at Cloudera. Kostas holds a bachelor’s degree in computer science from the University of Waterloo, Canada.