Presented By O'Reilly and Cloudera
Make Data Work
September 25–26, 2017: Training
September 26–28, 2017: Tutorials & Conference
New York, NY

A practitioner’s guide to Hadoop security for the hybrid cloud

Mark Donsky (Cloudera), Manish Ahluwalia (Nerdwallet), Andre Araujo (Cloudera), Syed Rafice (Cloudera)
1:30pm5:00pm Tuesday, September 26, 2017
Data Engineering & Architecture, Security
Location: 1A 18 Level: Intermediate
Secondary topics:  Cloud
Average rating: *****
(5.00, 1 rating)

Who is this presentation for?

  • Hadoop administrators and security ops engineers

Prerequisite knowledge

  • General knowledge of Hadoop and system administration procedures

Materials or downloads needed in advance

  • A WiFi-enabled laptop with the ability to run an SSH client

What you'll learn

  • Learn how to secure a Hadoop cluster for production operations


Many Hadoop clusters lack even the most basic security controls. This is due to several factors: some security features did not exist as recently as two years ago, and the complexity of Hadoop security has proved daunting to administrators.

Mark Donsky, André Araujo, Syed Rafice, and Manish Ahluwalia walk you through securing a Hadoop cluster. You’ll start with a cluster with no security and then add security features related to authentication, authorization, encryption of data at rest, encryption of data in transit, and complete data governance.

For each security feature, Mark, André, Syed, and Manish cover the following topics:

  • Introduction: What the security feature is, what protection it provides, and best practices and recommendations
  • Planning: How to enable the feature in a phased manner with the fewest growing pains and least risk
  • Relevance: Why it’s important (demonstrated by live attacks against a cluster without the target security feature)
  • Implementation: An overview of how the implementation is performed, where the moving parts are, and potential pitfalls
Photo of Mark Donsky

Mark Donsky


Mark Donsky leads data management and governance solutions at Cloudera. Previously, Mark held product management roles at companies such as Wily Technology, where he managed the flagship application performance management solution, and Silver Spring Networks, where he managed big data analytics solutions that reduced greenhouse gas emissions. He holds a BS with honors in computer science from the University of Western Ontario.

Photo of Manish Ahluwalia

Manish Ahluwalia


Manish Ahluwalia is a security engineering at Nerdwallet. Manish has held software architect roles at Tibco Loglogic and Thales Vormetric and was a security engineer at Cloudera, where he focused on the security of the Hadoop ecosystem. Manish has been working in big data since its infancy in various companies in Silicon Valley. He is most passionate about security.

Photo of Andre Araujo

Andre Araujo


André Araujo is a solutions architect with Cloudera. Previously, he was an Oracle database administrator. An experienced consultant with a deep understanding of the Hadoop stack and its components, André is skilled across the entire Hadoop ecosystem and specializes in building high-performance, secure, robust, and scalable architectures to fit customers’ needs. André is a methodical and keen troubleshooter who loves making things run faster.

Photo of Syed Rafice

Syed Rafice


Syed Rafice is a senior system engineer at Cloudera, where he specializes in big data on Hadoop technologies and is responsible for designing, building, developing, and assuring a number of enterprise-level big data platforms using the Cloudera distribution. Syed also focuses on both platform and cybersecurity. He has worked across multiple sectors, including government, telecoms, media, utilities, financial services, and transport.

Comments on this page are now closed.


Picture of Mohammed Ayub
Mohammed Ayub | DATA SCIENTIST
09/26/2017 5:27am EDT

Thanks Andre !

Picture of Andre Araujo
Andre Araujo | CLOUDERA
09/25/2017 8:06pm EDT

Hi, Mohammed,

Yes, we’ll post the slides after the tutorial and the recording will be available through the O’Reilly portal later on.


Picture of Mohammed Ayub
Mohammed Ayub | DATA SCIENTIST
09/25/2017 7:20pm EDT

Unfortunately, will miss this as it conflicts with another tutorial. Will the learning material be available for reference?