Skip to main content

Information Visualization for Large-Scale Data Workflows

Michael Conover (LinkedIn)
GA Ballroom K
Average rating: ****.
(4.42, 12 ratings)

The ability to instrument and interrogate data as it moves through a processing pipeline is fundamental to effective machine learning at scale. Applied in this capacity, information visualization technologies drive product innovation, shorten iteration cycles, reduce uncertainty, and ultimately improve the performance of predictive models. It can be challenging, however, to understand where in a workflow to employ data visualization, and, once committed to doing so, developing revealing visualizations that suggest clear next steps can be similarly daunting.

In this talk we’ll describe the role that information visualization technologies play in the LinkedIn data science ecosystem, and explore best practices for understanding the structure of large-scale data in a production environment. From hypothesis generation and feature development to model evaluation and tooling, visualization is at the heart of LinkedIn’s machine learning workflows, enabling our data scientists to reason and communicate more effectively. Broken down into clear, structured insights based on proven technology and workflow patterns, this talk will help you understand how to apply information visualization to the analytical challenges you encounter every day.

Photo of Michael Conover

Michael Conover

Staff Data Scientist, LinkedIn

Mike Conover builds machine-learning technologies that leverage the behavior and relationships of hundreds of millions of people. A staff data scientist at LinkedIn, Mike has a PhD in complex systems analysis with a focus on information propagation in large-scale social networks. His work has appeared in the New York Times, the Wall Street Journal, Science, and the MIT Technology Review as well as on National Public Radio.