Presented By O'Reilly and Cloudera
Make Data Work
March 13–14, 2017: Training
March 14–16, 2017: Tutorials & Conference
San Jose, CA

The enterprise geospatial platform: A perfect fusion of cloud and open source technologies

Naghman Waheed (Bayer Crop Science), Martin Mendez-Costabel (Bayer Crop Science)
5:10pm5:50pm Wednesday, March 15, 2017
Big data and the Cloud
Location: LL21 E/F Level: Intermediate
Secondary topics:  Architecture, Cloud, Geospatial
Average rating: ****.
(4.00, 1 rating)

Who is this presentation for?

  • Managers, cloud architects, and engineers

Prerequisite knowledge

  • A basic understanding of geospatial data processing and cloud technologies (specifically AWS)

What you'll learn

  • Understand how to build a scalable and secure geospatial system in the cloud using open source technologies

Description

Geospatial datasets and systems were introduced at Monsanto over a decade ago, and their significance and use has only increased over time. Moreover, the volume and variety of datasets that are geospatially tagged and collected is increasing exponentially. However, the systems in use today have struggled to keep up with the ever-increasing demand. To address this, the Monsanto Data Platform Architecture and Engineering team embarked on a journey to create a scalable geospatial platform in the cloud using only open source components. The result has been a fully scalable geospatial platform that is being utilized across the globe for processing of geospatial datasets for both visualization and analytics services.

Naghman Waheed and Martin Mendez-Costabel explain how Monsanto built this platform, focusing on the technical design and build of the entire system and covering the technical architecture, how and why the team chose certain open source components, and the lessons learned along the way. Naghman and Martin also highlight the value derived out of the new platform through examples of how the system is being used to provide analytics on top of large geospatial datasets.

The entire platform was designed with several key architecture and engineering principles in mind: it needed to use open source, be instantiated in AWS cloud, be easily scalable for both processing and storage needs, have automated monitoring and failure form recovery, and integrate with existing technologies such as API gateway and identity management. The platform also supports a pay-as-you-use model with spend visibility and accountability passed back the the user of the platform.

The platform was built using open source software, including CKAN as the data searching catalog, Geoserver as the geospatial processing engine, QGIS as the visualization tool, and S3, Amazon Elastic File System, PostGIS, and AWS ECS for data processing. The platform is fully integrated with AKAN and VDS (virtual directory service) and utilizes the OAuth2.0 security model.

Photo of Naghman Waheed

Naghman Waheed

Bayer Crop Science

Naghman Waheed is the data platforms lead at Bayer Crop Science, where he’s responsible for defining and establishing enterprise architecture and direction for data platforms. Naghman is an experienced IT professional with over 25 years of work devoted to the delivery of data solutions spanning numerous business functions, including supply chain, manufacturing, order to cash, finance, and procurement. Throughout his 20+ year career at Bayer, Naghman has held a variety of positions in the data space, ranging from designing several scale data warehouses to defining a data strategy for the company and leading various data teams. His broad range of experience includes managing global IT data projects, establishing enterprise information architecture functions, defining enterprise architecture for SAP systems, and creating numerous information delivery solutions. Naghman holds a BA in computer science from Knox College, a BS in electrical engineering from Washington University, an MS in electrical engineering and computer science from the University of Illinois, and an MBA and a master’s degree in information management, both from Washington University.

Photo of Martin Mendez-Costabel

Martin Mendez-Costabel

Bayer Crop Science

Martin Mendez-Costabel leads the geospatial data asset team for Monsanto’s Products and Engineering organization within the IT Department, where he drives the engineering and adoption of global geospatial data assets for the enterprise. He has more than 12 years of experience in the agricultural sector covering a wide range of precision agriculture-related roles, including data scientist and GIS manager for E&J Gallo Winery in California. Martin holds a BSc in agronomy from the National University of Uruguay and two viticulture degrees: an MSc from the University of California, Davis, and a PhD from the University of Adelaide in Australia.