Data science and the business of Major League Baseball





Who is this presentation for?
- Business stakeholders, strategy and analytics leadership, data scientists, data analysts, data engineers, business intelligence analysts, and business intelligence developers
Level
Description
When most people think of MLB and data science, they think of Moneyball, sabermetrics, or Statcast AI—data science from action on the field. However, with millions of local and global fans engaging with America’s pastime every day, and with 30 client organizations (i.e., each MLB club), there’s also a great deal of action happening off the field. From ticket and apparel purchases to live game streaming and app check-ins to email activity and clickstreams, MLB leverages data science to better serve its fans and clubs.
Matt Horton, Josh Hamilton, and Aaron Owen offer an overview of some of the many projects that MLB’s data science team undertakes. Incorporating tools such as SAS, Python, and AWS SageMaker, these projects include predicting ticket purchasers’ likelihood to purchase again, evaluating prospective season schedules, estimating customer lifetime value, home-game promotion optimization, quantifying the strength of fan avidity, and monitoring the health of monthly subscribers of MLB’s game-streaming service.
Prerequisite knowledge
- A basic understanding of statistics and data science concepts
What you'll learn
- Discover how MLB leverages data science and analytics to understand and serve its fans and clubs

Aaron Owen
Major League Baseball
Aaron Owen is a data scientist at Major League Baseball, where he leverages his skills to solve business problems for the organization and its 30 teams. Aaron holds an MS and PhD in evolutionary biology and was previously a professor at both the City University of New York and New York University.

Matthew Horton
Major League Baseball
Matt Horton is the senior director of data science at Major League Baseball (MLB). In his 11+ years at MLB, Matt has developed numerous projects including predicting ticket buyers’ future purchasing behavior to aid teams in prioritizing their marketing efforts and building a framework for predicting and preventing subscriber churn for MLB’s game-streaming service, MLB.TV. Matt’s team is focused on quantifying fans’ relationships with their favorite teams, modeling trends in both team and league-wide attendance, and estimating fans’ future engagement with MLB. Previously, Matt was at Rosetta and Accenture. He has a BS in statistics from the University of Tennessee and a master’s in applied statistics from Cornell University.
In addition to him being a huge baseball fan, Matt is also an avid fan of sports in general, rooting for teams from his home state of Tennessee, including the Volunteers, Titans, Predators, and Grizzlies.

Josh Hamilton
Major League Baseball
Josh Hamilton is a data scientist at Major League Baseball, where he works with the league and its 30 teams to build data pipelines and predictive models. Previously, Josh helped build data infrastructure and recommender systems for a movie streaming company and worked as a product manager for a platform-as-a-service startup. He studied finance and economics and holds an MS in applied statistics from the University of Alabama.
Presented by
Elite Sponsors
Strategic Sponsors
Zettabyte Sponsors
Contributing Sponsors
Exabyte Sponsors
Content Sponsor
Impact Sponsors
Supporting Sponsor
Non Profit
Contact us
confreg@oreilly.com
For conference registration information and customer service
partners@oreilly.com
For more information on community discounts and trade opportunities with O’Reilly conferences
strataconf@oreilly.com
For information on exhibiting or sponsoring a conference
pr@oreilly.com
For media/analyst press inquires