A data catalog provides context to help data analysts, data scientists, and other data consumers (including those with little technical background) find a relevant dataset, determine if it can be trusted, understand what it means, and utilize it to make better products and better decisions. Aaron Kalb explores how enterprises build interfaces that make sourcing data as easy as shopping on Amazon.
Aaron gives an overview of data catalogs and explains how they relate to concepts like data dictionaries or data inventories. He also covers some of the fastest and most effective ways to build a data catalog, discussing the roles crowds, experts, and machines play.
Aaron Kalb has spent his career crafting and empowering delightful human-computer interactions, especially through natural language interfaces. Aaron currently leads the design team and guides the product vision at Alation, after leaving Stanford with a BS and an MS in symbolic systems and working at Apple on iOS and Siri (doing engineering, research, and design in the Advanced Development Group). In his spare time, he enjoys backpacking, board games, and Thai food.
©2016, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • email@example.com
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.