Data science is a hot topic these days. Bart De Vylder breaks through the hype to provide a practical introduction to data analysis techniques applied to web performance data and related business metrics using Python.
Python is well known as a general purpose programming language. Moreover, the gap between interactive analysis and writing production-ready analytics code in Python is relatively small. Bart uses the iPython scientific ecosystem with NumPy, SciPy, and scikit-learn to make data analytics very approachable.
Drawing on a real-world anonymized dataset of monitoring data of backend server metrics, real-user monitoring data, and business metrics originating from an ecommerce website, Bart guides you through some techniques for visualizing (large) datasets, finding correlations between metrics (e.g., page load time versus conversion rate), applying machine-learning techniques to build a model of the data (e.g., which requests cause the most load on a server), anomaly detection, and forecasting of data. Bart ends with a challenging problem on the given dataset using one of the discussed techniques—with a nice prize for the attendee with the best solution.
Bart De Vylder is a data scientist at CoScale. Previously, Bart was active in software engineering and architecture, with a focus on distributed systems. His interests lie in machine learning and building reliable, scalable data processing systems. Bart holds a PhD in artificial intelligence from the Free University of Brussels.
©2016, O’Reilly UK Ltd • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org