Presented By O’Reilly and Intel AI
Put AI to work
Sep 4-5, 2018: Training
Sep 5-7, 2018: Tutorials & Conference
San Francisco, CA

Distributed deep learning in the cloud: Build an end-to-end application involving computer vision and geospatial data

Mary Wahl (Microsoft), Banibrata De (Microsoft)
9:00am-12:30pm Wednesday, September 5, 2018
Secondary topics:  Computer Vision, Deep Learning tools, Platforms and infrastructure

Who is this presentation for?

  • Data scientists and engineers, particularly those familiar with deep learning but are seeking guidance on scaling up distributed training on the cloud and handling the special challenges associated with geospatial data

Prerequisite knowledge

  • Experience with a deep learning framework (Microsoft’s Cognitive Toolkit not required)
  • A reading knowledge of Python

Materials or downloads needed in advance

  • A WiFi-enabled laptop
  • Download the required materials from the course GitHub repository or follow the included instructions to provision Azure Virtual Machines and Batch AI with your own account

What you'll learn

  • Understand how semantic segmentation models can be used to cheaply and quickly construct land cover maps to quantify trends like deforestation and urbanization, assisting in conservation efforts
  • Learn how to construct such models, using generally applicable skills like transitioning scripts to distributed training as well as those specific to the training data type and use case


Microsoft’s recently formed AI for Earth team helps NPOs apply AI to challenges in conservation biology and environmental science. Mary Wahl and Banibrata De highlight AI for Earth’s recent work with the Chesapeake Conservancy, a nonprofit organization charged with monitoring natural resources in the Chesapeake Bay watershed, a more than 165,000 square kilometer region in the eastern US extending from New York to Virginia. The Chesapeake Conservancy has meticulously assigned each square meter in this region a land cover label such as water, grass/herbaceous, tree/forest, or barren/impervious surface. The resulting high-quality, high-resolution land cover map is a valuable resource already used by environmental scientists and conservation efforts throughout the region. If similar maps could be generated at frequent timepoints, the data would allow researchers to quantify trends such as deforestation, urbanization, and the impacts of climate change. Unfortunately, the method used by Chesapeake Conservancy required substantial manual curation and postprocessing: the cost, time, and effort required severely limits how frequently the map can be updated.

In collaboration with the Chesapeake Conservancy and ESRI, AI for Earth’s team helped train a deep neural network model to predict land cover from a single high-resolution aerial imagery data source collected nationwide at frequent intervals. The team produced a neural network similar in architecture to Ronnenberger et al.’s U-Net, a commonly used semantic segmentation model type. After prototyping the training method in CNTK on a single-GPU Azure Geo AI Data Science Virtual Machine (DSVM) with a subset of available data, the team scaled up training to a 148-GPU cluster using Azure Batch AI. This transition allowed them to reduce the average duration of each training epoch 40-fold (near-linearity in scaling was lost with cluster sizes above 64 workers) and further reduced runtime by allowing the complete ~1 TB training dataset to be stored in memory rather than repeatedly accessed from disk. The trained model’s predictions are in excellent agreement with the conservancy’s land cover labels; many discrepancies are due to outdated information in the “ground truth” labels used during training.

Mary and Banibrata walk you through the end-to-end guide they published in early March describing their methods for training and applying this model. You’ll explore the use case and see the work in the context of the Azure resources in which it was developed. Along the way, Mary and Banibrata describe the source data and special challenges imposed by the very large file size and paired format, how the team handled common problems like label imbalance and partial missingness in training data, and the modifications required to enable distributed training in the script. They also highlight the speedups and other benefits achieved using data-parallel distributed training on a GPU cluster. They conclude by sharing how they surfaced this model for use in ArcGIS, a common geospatial information systems (GIS) software suite, so that its predictions can be shown in real time alongside aerial imagery as users scroll through a region of interest.

Photo of Mary Wahl

Mary Wahl


Mary Wahl is a data scientist on the AI for Earth team at Microsoft, which helps NGOs apply deep learning to problems in conservation biology and environmental science. Mary has also worked on computer vision and genomics projects as a member of Microsoft’s algorithms and data science solutions team in Boston. Previously, Mary studied recent human migration, disease risk estimation, and forensic reidentification using crowdsourced genomic and genealogical data as a Harvard College Fellow.

Photo of Banibrata De

Banibrata De


Banibrata De is a seasoned software engineer in Microsoft’s Algorithms and Data Science Group in Redmond, where he is the engineer for the Data Science Virtual Machine (DSVM) and works on a variety of solutions to help people democratize AI and ML using cutting-edge tools. Previously, he worked with the Windows Defender team to help protect Microsoft customers against various security vulnerabilities and was a performance software engineer for key Microsoft services and products helping to make the end-user experience enjoyable. He holds a degree in computer science from Jadavpur University, Kolkata, India.