Presented By O'Reilly and Cloudera
Make Data Work
22–23 May 2017: Training
23–25 May 2017: Tutorials & Conference
London, UK

Big data governance for the hybrid cloud: Best practices and how-to

Mark Donsky (Cloudera), Vikas Singh (Cloudera)
11:1511:55 Wednesday, 24 May 2017
Big data and the Cloud, Law, ethics, governance
Location: Capital Suite 14
Level: Intermediate
Average rating: ****.
(4.33, 9 ratings)

Who is this presentation for?

  • Data stewards, administrators, compliance officers, and curators

Prerequisite knowledge

  • A general familiarity with Hadoop technologies and common governance requirements, such as metadata tagging and lineage

What you'll learn

  • Learn a step-by-step approach to kickstart your big data governance initiatives, as well as how to apply uniform governance standards to on-premises, cloud-based, and hybrid deployments of Hadoop, how to protect your Hadoop deployments from security breaches, and how to safely roll out Hadoop while satisfying the needs of compliance groups, data stewards, data scientists, and BI users alike


Organizations must rethink their data management architectures as they embrace Hadoop. They must protect their most critical data assets while empowering the business to perform innovative analysis. At the same time, they must answer the following questions:

  • How can you make sure data doesn’t get lost in Hadoop?
  • How can users across the company find the data they care about and know that it’s trustworthy?
  • How can you protect yourself against a data breach?
  • How can you ingest ever-increasing volumes of diverse data while meeting these needs?
  • How can you ensure unified management across on-premises and cloud deployments of Hadoop?

Mark Donsky and Vikas Singh explain how some of the world’s most sophisticated Hadoop deployments are addressing these data challenges head-on while preserving Hadoop’s flexibility through an integrated data management and governance approach for Hadoop. Mark and Vikas share a big data governance capability maturity model that provides a step-by-step guide to kickstart your big data governance initiatives, based on best practices from leading big data deployments that span many different industries, both regulated and unregulated. Along the way, they also discuss how users can discover, trust, protect, and govern the data that matters most.

Photo of Mark Donsky

Mark Donsky


Mark Donsky leads data management and governance solutions at Cloudera. Previously, Mark held product management roles at companies such as Wily Technology, where he managed the flagship application performance management solution, and Silver Spring Networks, where he managed big data analytics solutions that reduced greenhouse gas emissions. He holds a BS with honors in computer science from the University of Western Ontario.

Vikas Singh


Vikas Singh is a software engineer at Cloudera.

Leave a Comment or Question

Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?

Join the conversation here (requires login)