Building out a data governance strategy on Hadoop has many unique and new challenges. Massively growing organizational models, the desire for more ETL offloading, and new Agile methodology practices are all driving the need to define what next-generation data governance means to an enterprise. Clara Fletcher explores what next-generation data governance will look like and what the trends will be in this space. Beyond the strategy overview, Clara dives into key technical challenges when building out a data governance-driven Hadoop platform. Clara concludes with a demonstration that shows how a data governance-driven platform operates in the enterprise, from Spark Streaming ingestion to JMS metadata tagging to master data management integration.
Clara Fletcher is a senior manager and technical architect at Accenture. Clara comes from a broad background that includes econometric forecasting, complex event processing, and infrastructure design and enterprise data provisioning. She is actively involved with emerging big data technologies, industry interest groups, and volunteer education programs. She has won the National Service Trust award, served as a Hackbright mentor, and holds a patent in digital document verification technology. Clara also instructs the Accenture hands-on big data course and is the lead of the online NoSQL course development.
©2016, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.