Presented By O'Reilly and Cloudera
Make Data Work
September 25–26, 2017: Training
September 26–28, 2017: Tutorials & Conference
New York, NY

Topic modeling openNASA data

Noemi Derzsy (Rensselaer Polytechnic Institute)
4:35pm5:15pm Thursday, September 28, 2017
Data science & advanced analytics
Location: 1E 15/16 Level: Intermediate
Secondary topics:  Text

Who is this presentation for?

  • Data scientists and analysts, government employees, and anyone interested in open government data

Prerequisite knowledge

  • Basic familiarity with Python

What you'll learn

  • Explore NASA's open dataset structure and content
  • Learn topic modeling techniques to understand different government data associations


Open source data has enabled society to engage in community-based research and has provided government agencies with more visibility and trust from individuals. Noemi Derzsy offers an overview of the openNASA platform (NASA’s 32,000+ open dataset collection) and discusses openNASA metadata analysis and tools for applying NLP and topic modeling techniques to understand open government dataset associations.

Photo of Noemi Derzsy

Noemi Derzsy

Rensselaer Polytechnic Institute

Noemi Derzsy is a postdoctoral research associate at the Social Cognitive Network Academic Research Center at Rensselaer Polytechnic Institute, where she uses data sets to analyze, understand, and model complex systems using network science and data science techniques. She’s also a NASA datanaut. Noemi holds a PhD in physics.