It is widely accepted that where you live says a lot about who you are, demographically speaking. At the same time, many companies are desperate to find out more about their customers in order to better understand them. By knowing where they live however, many companies are sitting on an extremely rich dataset from which they could learn a lot about their customers. Furthermore, this data can be used to optimize their marketing strategy and help them expand their customer base.
Gary Willis offers a technical presentation of a novel algorithm to help companies leverage locational data they have on their clients. The technique enriches a customer dataset using UK census data and then applies a novel, tree-based unsupervised learning algorithm to extract differentiating demographic features, making it possible to identify high-value postcodes without performing anomaly detection on the entirety of the UK population.
Along the way, Gary also discusses a wide range of further potential applications with census data and other datasets. For instance, fires or A&E admissions are relatively rare events where one would like to avoid having to perform anomaly detection on the entire UK population or all UK households.
Gary Willis is a data scientist at ASI with a diverse background in applying machine-learning techniques to commercial data science problems. Gary holds a PhD in statistical physics; his research looked at Markov Chain Monte Carlo simulations of complex systems.
©2017, O’Reilly UK Ltd • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • email@example.com