Presented By O’Reilly and Cloudera
Make Data Work
September 11, 2018: Training & Tutorials
September 12–13, 2018: Keynotes & Sessions
New York, NY

Getting ready for GDPR: Securing and governing hybrid, cloud, and on-premises big data deployments, step by step

Mark Donsky (Okera), Syed Rafice (Cloudera), Mubashir Kazia (Cloudera), Ifigeneia Derekli (Cloudera), Camila Hiskey (Cloudera)
9:00am–12:30pm Tuesday, 09/11/2018
Data engineering and architecture, Law, ethics, governance
Location: 1E 11 Level: Intermediate
Secondary topics:  Data preparation, governance and privacy, Ethics and Privacy
Average rating: ****.
(4.50, 2 ratings)

Who is this presentation for?

  • Those responsible for compliance, infosec, or security, administrators, and data stewards

Prerequisite knowledge

  • A general understanding of big data, security, and governance concepts

What you'll learn

  • Learn best practices for security and governing cloud, on-premises, and hybrid deployments, including wire encryption, data at rest encryption, governance best practices, unified, secure, and data catalogs for self-service discovery
  • Understand important aspects of GDPR


Many Hadoop clusters lack even the most basic security and governance controls. This is due to several factors: some security features did not exist as recently as two years ago, and the complexity of Hadoop security has proved daunting to administrators. Nonetheless, with the emergence of regulations such as GDPR, organizations can no longer afford to overlook the criticality of security and governance.

Mark Donsky, Syed Rafice, Mubashir Kazia, Ifigeneia Derekli, and Camila Hiskey share hands-on best practices for ensure a consistently secured and governed environment across multiple workloads, with special attention paid to GDPR. Mark, Syed, Mubashir, Ifigeneia, and Camila walk you through securing a Hadoop cluster. You’ll start with a cluster with no security and then add security features related to authentication, authorization, encryption of data at rest, encryption of data in transit, and complete data governance.

For each security feature, they cover the following topics:

  • Introduction: What the security feature is, what protection it provides, and best practices and recommendations
  • Planning: How to enable the feature in a phased manner with the fewest growing pains and least risk
  • Relevance: Why it’s important (demonstrated by live attacks against a cluster without the target security feature), and how it relates to GDPR
  • Implementation: An overview of how the implementation is performed, where the moving parts are, and potential pitfalls
Photo of Mark Donsky

Mark Donsky


Mark Donsky is a director of product management at Okera, a software provider that provides discovery, access control, and governance at scale for today’s modern heterogenous data environments, where he leads product management. Previously, Mark led data management and governance solutions at Cloudera, and he’s held product management roles at companies such as Wily Technology, where he managed the flagship application performance management solution, and Silver Spring Networks, where he managed big data analytics solutions that reduced greenhouse gas emissions by millions of dollars annually. He holds a BS with honors in computer science from the Western University, Ontario, Canada.

Photo of Syed Rafice

Syed Rafice


Syed Rafice is a principal system engineer at Cloudera specializing in big data on Hadoop technologies and both platform and cybersecurity. He is responsible for designing, building, developing, and assuring a number of enterprise-level big data platforms using the Cloudera distribution. Syed has worked across multiple sectors including government, telecoms, media, utilities, financial services, and transport.

Photo of Mubashir Kazia

Mubashir Kazia


Mubashir Kazia is a principal solutions architect at Cloudera and an SME in Apache Hadoop security in Cloudera’s Professional Services practice, where he helps customers secure their Hadoop clusters and comply to internal security policies. He also helps new customers transition to Hadoop platform and implement their first few use cases and trains and mentors peers in Hadoop and Hadoop security. Mubashir has worked with customers from all verticals, including banking, manufacturing, healthcare, telecom, retail, and gaming. Previously, he worked on developing solutions for leading investment banking firms.

Photo of Ifigeneia Derekli

Ifigeneia Derekli


Ifi Derekli is a senior solutions engineer at Cloudera, focusing on helping large enterprises solve big data problems using Hadoop technologies. Her subject-matter expertise is around security and governance, a crucial component of every successful production big data use case. Previously, Ifi was a presales technical consultant at Hewlett Packard Enterprise, where she provided technical expertise for Vertica and IDOL (currently part of Micro Focus). She holds a BS in electrical engineering and computer science from Yale University.

Photo of Camila Hiskey

Camila Hiskey


Camila Hiskey is a senior systems engineer at Cloudera. A hands-on technologist, she architects enterprise data solutions primarily for large financial services and life sciences organizations. Camila helps educate IT and business teams implement Hadoop, open source software, and big data. Previously, she was an engineer and DBA at IBM, where she worked with operational data stores and analytical databases.

Comments on this page are now closed.


Thushara Wickram | BIG DATA ADMIN
09/11/2018 6:19am EDT

Is there a way to get the commands you use in demo and slides now ?