Mar 15–18, 2020

Evaluation of traditional and novel feature selection approaches

Ben Fowler (Southeast Toyota Finance)
4:15pm4:55pm Tuesday, March 17, 2020
Location: 210 C/G

Who is this presentation for?

  • Data scientists and data scientist managers




A central component to the machine learning process is feature selection. Selecting the optimal set of features is important to generate a best fit model that generalizes to unseen data. A widely used approach for feature selection involves calculating Gini Importance (Gain) to identify the best set of features.

However, recent work from Scott Lundberg has found challenges with the consistency of the Gain attribution method. Ben Fowler shares the results of model metrics on the Lending Club dataset, testing five different feature selection approaches. The approaches tested involved widely used approaches combined with novel approaches for feature selection.

You’ll discover the impact of the data splitting method, including relevant two-way and three-way interactions (xgbfir library), backwards stepwise feature selection as opposed to a singular feature selection step, and backwards stepwise feature selection using Shapley values (shap library).

Prerequisite knowledge

  • A high-level understanding of the modeling process
  • Experience building machine learning models (useful but not required)
  • What you'll learn

    • Gain added predictive power and velocity to the feature selection process
    Photo of Ben Fowler

    Ben Fowler

    Southeast Toyota Finance

    Ben Fowler is a machine learning technical leader at Southeast Toyota Finance, where he leads the end-to-end model development. He’s been in the field of data science for over five years. Ben has been a guest speaker at the Southern Methodist University program multiple times. Additionally, he’s spoken at the PyData Miami 2019 Conference and has spoken multiple times at the West Palm Beach Data Science Meetup. He earned a master of science in data science from Southern Methodist University.

    Leave a Comment or Question

    Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?

    Join the conversation here (requires login)

    Contact us

    For conference registration information and customer service

    For more information on community discounts and trade opportunities with O’Reilly conferences

    Become a sponsor

    For information on exhibiting or sponsoring a conference

    For media/analyst press inquires