Sep 23–26, 2019

Search Logs + Machine Learning = Auto-Tagged Inventory

John Berryman (Eventbrite)
4:35pm5:15pm Wednesday, September 25, 2019
Location: 3B - Expo Hall
Secondary topics:  Text and Language processing and analysis

Who is this presentation for?

data-driven application engineers, search technologists, taxonomists, e-commerce




For e-commerce applications, matching users with the items they want is the name of the game. If they can’t find what they want then how can they buy anything?! Typically this functionality is provided through search and browse experience. Search allows users to type in text and match against the text of the items in the inventory. Browse allows users to select filters and slice-and-dice the inventory down to the subset they are interested in. But with the shift toward mobile devices, no one wants to type anymore – thus browse is becoming dominant in the e-commerce experience.

But there’s a problem! What if your inventory is not categorized? Perhaps your inventory is user generated or generated by external providers who don’t tag and categorize the inventory. No categories and no tags means no browse experience and missed sales. You could hire an army of taxonomists and curators to tag items – but training and curation will be expensive. You can demand that your providers tag their items and adhere to your taxonomy – but providers will buck this new requirement unless they see obvious and immediate benefit. Worse, providers might use tags to game the system – artificially placing themselves in the wrong category to drive more sales. Worst of all, creating the right taxonomy is hard. You have to structure a taxonomy to realistically represent how your customers think about the inventory.

Eventbrite is investigating a tantalizing alternative: using a combination of customer interactions and machine learning to automatically tag and categorize our inventory. As customers interact with our platform – as they search for events and click on and purchase events that interest them – we implicitly gather information about how our users think about our inventory. Search text effectively acts like a tag and a click on an event card is a vote for that clicked event is representative of that tag. We are able to use this stream of information as training data for a machine learning classification model; and as we receive new inventory, we can automatically tag it with the text that customers will likely use when searching for it. This makes it possible to better understand our inventory, our supply and demand, and most importantly this allows us to build the browse experience that customers demand.

In this talk I will explain in depth the problem space and Eventbrite’s approach in solving the problem. I will describe how we gathered training data from our search and click logs, and how we built and refined the model. I will present the output of the model and discuss both the positive results of our work as well as the work left to be done. Those attending this talk will leave with some new ideas to take back to their own business.

Prerequisite knowledge

Basic machine learning involving text manipulation, classification algorithms, and neural networks.

What you'll learn

Attendees will come away with a clever technique for generating tags for products based upon the search behavior of customers.
Photo of John Berryman

John Berryman


John Berryman started out in the field of Aerospace Engineering but soon found that he was more interested in math and software than in satellites and aircraft. He soon made the leap into software development specializing in search and recommendation technologies. John is now a Senior Software Engineer at Eventbrite, where he is helping build Eventbrite’s event discovery platform. He also recently coauthored a tech book, Relevant Search, published by Manning. The proceeds from the book have mostly paid for the coffee consumed while writing it.

Leave a Comment or Question

Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?

Join the conversation here (requires login)

Contact us

For conference registration information and customer service

For more information on community discounts and trade opportunities with O’Reilly conferences

For information on exhibiting or sponsoring a conference

Contact list

View a complete list of Strata Data Conference contacts