Presented By O'Reilly and Cloudera
Make Data Work
22–23 May 2017: Training
23–25 May 2017: Tutorials & Conference
London, UK

Big data governance for the hybrid cloud: Best practices and how-to

Mark Donsky (Okera), Vikas Singh (Cloudera)
11:1511:55 Wednesday, 24 May 2017
Big data and the Cloud, Law, ethics, governance
Location: Capital Suite 14
Level: Intermediate
Average rating: ****.
(4.33, 9 ratings)

Who is this presentation for?

  • Data stewards, administrators, compliance officers, and curators

Prerequisite knowledge

  • A general familiarity with Hadoop technologies and common governance requirements, such as metadata tagging and lineage

What you'll learn

  • Learn a step-by-step approach to kickstart your big data governance initiatives, as well as how to apply uniform governance standards to on-premises, cloud-based, and hybrid deployments of Hadoop, how to protect your Hadoop deployments from security breaches, and how to safely roll out Hadoop while satisfying the needs of compliance groups, data stewards, data scientists, and BI users alike


Organizations must rethink their data management architectures as they embrace Hadoop. They must protect their most critical data assets while empowering the business to perform innovative analysis. At the same time, they must answer the following questions:

  • How can you make sure data doesn’t get lost in Hadoop?
  • How can users across the company find the data they care about and know that it’s trustworthy?
  • How can you protect yourself against a data breach?
  • How can you ingest ever-increasing volumes of diverse data while meeting these needs?
  • How can you ensure unified management across on-premises and cloud deployments of Hadoop?

Mark Donsky and Vikas Singh explain how some of the world’s most sophisticated Hadoop deployments are addressing these data challenges head-on while preserving Hadoop’s flexibility through an integrated data management and governance approach for Hadoop. Mark and Vikas share a big data governance capability maturity model that provides a step-by-step guide to kickstart your big data governance initiatives, based on best practices from leading big data deployments that span many different industries, both regulated and unregulated. Along the way, they also discuss how users can discover, trust, protect, and govern the data that matters most.

Photo of Mark Donsky

Mark Donsky


Mark Donsky is a director of product management at Okera, a software provider that provides discovery, access control, and governance at scale for today’s modern heterogenous data environments, where he leads product management. Previously, Mark led data management and governance solutions at Cloudera, and he’s held product management roles at companies such as Wily Technology, where he managed the flagship application performance management solution, and Silver Spring Networks, where he managed big data analytics solutions that reduced greenhouse gas emissions by millions of dollars annually. He holds a BS with honors in computer science from the Western University, Ontario, Canada.

Vikas Singh


Vikas Singh is a software engineer at Cloudera.