Organizations have struggled to bring their data lakes into mainstream, due to the fact that many of them lack adequate governance. This limits the use of data lakes beyond a sandbox for data science workloads. However, to effectively deliver a broad set of use cases, organizations have to make sure that their data assets are properly governed, secured, and trustworthy.
A new raft of regulations, such as the EU’s GDPR, are providing the required catalyst to improve data hygiene. Data governance is no longer optional. It never was. In 2018, the sexist job now involves data governance. The question is how you can make this task impactful and easier to accomplish.
Sanjeev Mohan walks you through an end-to-end architectural blueprint for information governance and shares best practices for helping organizations understand, secure, and govern diverse types of data in enterprise data lakes.
Sanjeev Mohan leads big data research for technical professionals at Gartner, where he researches trends and technologies for relational and NoSQL databases, object stores, and cloud databases. His areas of expertise span the end-to-end data pipeline, including ingestion, persistence, integration, transformation, and advanced analytics. Sanjeev is a well-respected speaker on big data and data governance. His research includes machine learning and the IoT. He also serves on a panel of judges for many Hadoop distribution organizations, such as Cloudera and Hortonworks.
For exhibition and sponsorship opportunities, email strataconf@oreilly.com
For information on trade opportunities with O'Reilly conferences, email partners@oreilly.com
View a complete list of Strata Data Conference contacts
©2018, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • confreg@oreilly.com