Reinforcement learning (RL) learns complex processes autonomously like walking, beating the world champion in Go, or flying a helicopter. No big datasets with the “right” answers are needed: the algorithms learn by experimenting.
Christian Hidber shows how and why RL works and demonstrates how to apply it to an industrial hydraulics application with 7,000 clients in 42 countries—illustrating the inner workings by the way a child learns to play a new game. The industrial application stems from siphonic roof drainage systems, which ensure that large buildings like stadiums, airports, and shopping malls do not collapse during heavy rainfalls. Choosing the “right” diameters is difficult, requiring intuition and hydraulic expertise. As of today no feasible, deterministic algorithm is known. Christian explains how using reinforcement learning, he and his team were able to reduce the failure rate of the existing solution (based on classic supervised learning) by more than 70%.
Christian Hidber is a software engineer at bSquare, where he applies machine learning to industrial hydraulics simulation, part of a product with 7,000 installations in 42 countries. He holds a PhD in computer algebra from ETH Zurich, which he followed with a postdoc at UC Berkeley, where he researched online data mining algorithms.
For exhibition and sponsorship opportunities, email strataconf@oreilly.com
For information on trade opportunities with O'Reilly conferences, email partners@oreilly.com
View a complete list of Strata Data Conference contacts
©2019, O’Reilly UK Ltd • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • confreg@oreilly.com