Biological data and other data collected from complex systems can have tens of thousands of variables that interact nonlinearly. Interactive visualizations enable users to develop an intuition about the global structure of their data and immediately identify patterns. While dimensionality reduction techniques are useful for obtaining a bird’s eye view of data, these techniques often obscure important information. Heatmaps, or clustergrams, are powerful alternative but complementary visualization techniques for directly visualizing all variables from high-dimensional data. While there are many software tools that can generate clustergrams, few are web based, fully interactive, or seamlessly integrated into Jupyter notebooks.
Nicolas Fernandez offers an overview of Clustergrammer-Widget, which enables users to easily visualize high-dimensional data (e.g., a pandas DataFrame) within a Jupyter notebook as an interactive hierarchically clustered heatmap. Clustergrammer-Widget generates highly interactive visualizations (e.g., reorderable and zoomable) that can be embedded within notebooks and shared using nbviewer. Clustergrammer-Widget was developed to analyze high-dimensional biological data but can be applied to any high-dimensional data from other fields. Nicolas explains how to use Jupyter notebooks and Clustergrammer-Widget to produce transparent and reproducible analyses for a wide variety of biological datasets and demonstrates how to share your results with collaborators.
Nicolas Fernandez is a computational scientist at the Human Immune Monitoring Center at the Icahn School of Medicine at Mount Sinai. Nicolas is a computational biologist with interests in analysis and visualization of high-throughput biological data as a means to understanding biological regulatory networks.
©2018, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org