Brought to you by NumFOCUS Foundation and O’Reilly Media
The official Jupyter Conference
Aug 21-22, 2018: Training
Aug 22-24, 2018: Tutorials & Conference
New York, NY

Using Jupyter to create a community for satellite imagery analysis and sharing

Sean Gorman (DigitalGlobe)
4:10pm–4:50pm Friday, August 24, 2018

Who is this presentation for?

  • Data scientists, product managers, community builders, and enterprise architects

Prerequisite knowledge

  • Familiarity with the Jupyter ecosystem
  • A basic understanding of Python (useful but not required)

What you'll learn

  • Learn how to leverage the Jupyter ecosystem to develop custom community and enterprise products
  • Understand best practices for creating communities of interest around Jupyter and challenges and solutions of integrating large data repositories with a Jupyter stack
  • Discover creative approaches to onboading new users to Jupyter with sequenced clone and edit tutorials

Description

Satellite imagery can be a critical resource during disasters and humanitarian crises. While the community has improved data sharing, we still struggle to create reusable data science to solve problems on the ground.

Sean Gorman offers an overview of GBDX Notebooks, a step toward creating an open data science community built around Jupyter to stream imagery and share analysis at scale. Sean explains how the GBDX team leveraged and extended Jupyter to create a platform for satellite imagery analysis and community collaboration and how they created buy-in for Jupyter and facilitated open source work along with contributions in a corporate/proprietary setting. Sean then shares lessons learned and provides a template for how Jupyter can be leveraged to create unique products for both humanitarian and commercial purposes.

Topics include:

  • Designing a community hub interface for discovering notebooks and tutorials
  • Managing the integration of a 17 PB archive of data to the Jupyter interface
  • Creating custom interfaces with React in Jupyter notebooks to discover satellite imagery
  • Injecting dynamically created satellite imagery products as ndarrays
  • Leveraging Dask for deferred compute of large satellite images
  • Publishing notebooks and access controls
  • Managing access to both open and proprietary data sources
  • Onboarding users to new data sources and applied methods with notebooks
Photo of Sean Gorman

Sean Gorman

DigitalGlobe

Sean Gorman is the head of technical product management at DigitalGlobe. Previously, Sean was a cofounder of Timbr.io, a platform for enabling algorithmic orchestrations with sensor and social data (acquired by DigitalGlobe), and the founder and CEO of GeoIQ, a collaborative data and analytics company serving commercial and government customers (acquired by Esri). Sean also worked at Esri integrating social data with Esri’s mapping technologies and was a research professor at George Mason University, where he focused on the intersection of complexity science, statistical mechanics, and spatial analysis. Sean holds a PhD from George Mason University, where he was the Provost’s High Potential Research Candidate, a Fisher Prize winner, and an INFORMS Dissertation Prize recipient.