Presented By O’Reilly and Cloudera

San Jose • London • New York

Make Data Work

March 5–6, 2018: Training
March 6–8, 2018: Tutorials & Conference
San Jose, CA

Human in the loop: Bayesian rules enabling explainable AI

Pramit Choudhary (h2o.ai)

2:40pm–3:20pm Thursday, March 8, 2018

Data science and machine learning, Law, ethics, and governance
Location: LL20 C

Average rating:

(5.00, 3 ratings)

Download slides (PDF)

Who is this presentation for?

Data scientists, data analysts, and machine learning practitioners

Prerequisite knowledge

A basic understanding of generative and discriminative models, machine learning, and statistical modeling

What you'll learn

Understand the concept of model evaluation and interpretability as it relates to enterprise data challenges as well as Bayesian inference
Learn best practices for understanding a model's behavior when building predictive modeling pipelines

Description

The adoption of machine learning to solve real-world problems has increased exponentially, but users still struggle to derive full potential of the predictive models. It is no longer sufficient to evaluate a model’s accurate prediction just on a validation set based on error metrics. However, there is still a dichotomy between explainability and model performance when choosing an algorithm. Linear models and simple decision trees are often preferred over more complex models such as ensembles or deep learning models for ease of interpretation, but this often results in loss in accuracy. However, is it actually necessary to accept a trade-off between model complexity and interpretability?

Pramit Choudhary explores the usefulness of a generative approach that applies Bayesian inference to generate human-interpretable decision sets in the form of “if. . .and else” statements. These human interpretable decision lists with high posterior probabilities might be the right way to balance between model interpretability, performance, and computation. This is an extension of DataScience.com’s ongoing effort to enable trust in predictive algorithms to drive better collaboration and communication among peers. Pramit also outlines DataScience.com’s open source model interpretation framework, Skater, and explains how it helps practitioners understand model behavior better without compromising on the choice of algorithm.

Pramit Choudhary

h2o.ai

Pramit Choudhary is a Lead data scientist/ML scientist at h2o.ai, where he focuses on optimizing and applying classical machine learning and Bayesian design strategy to solve large scale real-world problems.
Currently, he is leading initiatives on figuring out better ways to generate a predictive model’s learned decision policies as meaningful insights(Supervised/Unsupervised problems)

Website

Comments on this page are now closed.

Comments

Pramit Choudhary | LEAD DATA SCIENTIST

03/10/2018 2:56am PST

Thanks for joining the talk everyone. Feel free to reach out if you have questions or suggestions.
Check out Skater
Will post the slides from the presentation soon

Presented by

Elite Sponsors

Strategic Sponsors

Zettabyte Sponsor

Contributing Sponsors

Exabyte Sponsors

Impact Sponsors

Sponsorship Opportunities

For exhibition and sponsorship opportunities, email strataconf@oreilly.com

Partner Opportunities

For information on trade opportunities with O'Reilly conferences, email partners@oreilly.com

Contact Us

View a complete list of Strata Data Conference contacts

©2018, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • confreg@oreilly.com