Garbage in, garbage out—this truism has become significantly more impactful for big data as companies have moved away from traditional schema-based approaches to more flexible and dynamic file system approaches. Central to this journey is the need to industrialize ingestion and distillation, the core processes of big data management. Steve Jones explains how to add governance, schema evolution, and the industrialization required to deliver true enterprise-grade big data solutions.
This session is sponsored by Capgemini.
Steve Jones is Capgemini’s group vice president for big data. Steve focuses on delivering large-scale big data solutions that answer point business challenges. He is the author of Enterprise SOA Adoption Strategies and the creator of the Business Data Lake reference architecture, the first unified approach to big and fast data analytics. Steve was also one of the very first to have integrated Google, Salesforce, and Amazon solutions into traditional enterprises.
Comments on this page are now closed.
©2016, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • email@example.com
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.