Sep 23–26, 2019
Please log in

Executive Briefing: Data catalogs—Concepts, capabilities, and key platforms

Andrew Brust (Blue Badge Insights | ZDNet)
4:35pm5:15pm Wednesday, September 25, 2019
Location: 1E 10/11
Average rating: ****.
(4.83, 6 ratings)

Who is this presentation for?

  • CIOs, CTOs, chief data officers, data stewards, database administrators (DBAs), data analysts, and BI architects

Level

Intermediate

Description

Data catalogs are not new; in fact they’ve been around for decades. But in the age of data lakes, self-service analytics, and data protection regulation, they’ve taken on new capabilities and renewed importance. There are a number of products in the market now, and they differ greatly, with a number of subcategories in the space.

Some data catalogs focus on data discoverability, others on governance and security. Some are oriented toward relational databases and data warehouses, while others are tied to more modern data sources. Many of the products use AI and machine learning to help automate the catalog build out, but almost all of them do so differently. And, while there are several startups in the field, public cloud providers have entries here, as do incumbent software megavendors.

Andrew Brust guides you through the importance of data catalogs, covers the range of data catalog capabilities, and explores the key players and their platforms. Andrew also provides an analysis of where the space is headed and what it will need to provide to address customer needs and pain points. You’ll get up to speed on the subject quickly, and no prior data catalog knowledge is required.

Prerequisite knowledge

  • A basic understanding of databases, data files, and data types
  • General knowledge of data warehouses and data lakes (useful but not required)

What you'll learn

  • Discover concepts of data catalogs including schema, metadata, data classification, business glossaries, tagging, data set endorsement, personally identifiable information (PII), sensitive data protection, regulatory compliance, data marketplaces, and more
  • Explore the role of machine learning and AI in catalog automation and relationship discovery
Photo of Andrew Brust

Andrew Brust

Blue Badge Insights | ZDNet

Andrew Brust is founder and CEO of Blue Badge Insights, a blogger for ZDNet Big Data, and is a data and analytics-focused analyst for GigaOm. He’s the coauthor of Programming Microsoft SQL Server 2012, a Microsoft tech influencer, and advises data and analytics ISVs on winning in the market, solution providers on their service offerings, and customers on their analytics strategy. Andrew is an entrepreneur, a consulting veteran, a former research director, and a current Microsoft Data Platform MVP.

  • Cloudera
  • O'Reilly
  • Google Cloud
  • IBM
  • Cisco
  • Dataiku
  • Intel
  • Io-Tahoe
  • MemSQL
  • Microsoft Azure
  • Oracle Cloud Infrastructure
  • SAS
  • Arcadia Data
  • BMC Software
  • Hazelcast
  • SAP
  • Amazon Web Services
  • Anaconda
  • Esri
  • Infoworks.io, Inc.
  • Kyligence
  • Pitney Bowes
  • Talend
  • Google Cloud
  • Confluent
  • DataStax
  • Dremio
  • Immuta
  • Impetus Technologies Inc.
  • Keyence
  • Kyvos Insights
  • StreamSets
  • Striim
  • Syncsort
  • SK holdings C&C

    Contact us

    confreg@oreilly.com

    For conference registration information and customer service

    partners@oreilly.com

    For more information on community discounts and trade opportunities with O’Reilly conferences

    strataconf@oreilly.com

    For information on exhibiting or sponsoring a conference

    pr@oreilly.com

    For media/analyst press inquires