Skip to main content

Analyzing Data with Python

Sarah Guido (InVision)
Average rating: ****.
(4.00, 6 ratings)
Slides:   1-PPTX 

Python is quickly becoming the go-to language for data analysis. There are so many tools out there that it can be overwhelming for those that are new to analyzing data in Python. In this presentation, I’ll discuss several of the best tools for working with data, how to structure a data analysis workflow, and which tools are appropriate for handling different kinds of data. You’ll leave with a good understanding of different data analysis techniques in Python and some ideas to try on your own.

I’ll show you examples of each of the following:

  • Data preprocessing and data wrangling with Pandas
  • Using Scikit-Learn for machine learning
  • Using the Natural Language Toolkit for natural language processing
  • Running MapReduce jobs with MRJob
  • Visualizing our results with matplotlib
Photo of Sarah Guido

Sarah Guido


Sarah Guido is a senior data scientist at InVision where she studies user collaboration through data. She’s an accomplished conference speaker and O’Reilly author and enjoys making data science as accessible as possible to a broad audience. Sarah attended graduate school at the University of Michigan’s School of Information.