Everything open source
May 16–17, 2016: Training & Tutorials
May 18–19, 2016: Conference
Austin, TX

Navigating the data science Python ecosystem

Christine Doig (Continuum Analytics)
4:20pm–5:00pm Thursday, 05/19/2016
Data
Location: Ballroom F Level: Intermediate
Average rating: ****.
(4.50, 6 ratings)

Prerequisite knowledge

Attendees should be familiar with Python and basic data science concepts.

Description

Do you feel lost with terms like data science, machine learning, deep learning, neural networks, or supervised learning? Have you tried to start a data science project, but felt overwhelmed with all the libraries? Are you having trouble figuring out how to proceed? Have you heard of scikit-learn, Theano, Dask, xarray, Blaze, gensim, Bokeh, PyMC3, Numba, and Jupyter but don’t know what each library is for? This talk is for you.

Christine Doig explores data science and the state of the Python ecosystem and helps navigate the large amount of open source libraries available for data science in Python, providing a map to guide you on the journey. You’ll learn what data science is and discover existing libraries, their functionalities, and applications.

The talk will be divided in three sections:

  1. Introduction to data science: Christine revisits the definition of data scientist, analyzes whether it is reasonable to define a role by those skills, and explores what it takes in practice to deliver a successful data science project.
  2. The state of Python for data science: Christine outlines the big picture of the Python ecosystem to lower the barrier of entry and help draw the connection between libraries and features.
  3. From data to models to applications: Christine walks you through a complete workflow for doing data science, from data collection and development environment setup to data modeling and building a data applications—everything without leaving the Python programming language.
Photo of Christine Doig

Christine Doig

Continuum Analytics

Christine Doig is a data scientist at Continuum Analytics. Christine loves Python and sharing her open source findings with others. She has taught tutorials and presented many talks on data science and Python libraries like conda, Blaze, Bokeh, and scikit-learn at EuroPython, PyTexas, PyGotham, PyCon Spain, PyData (Dallas, Berlin), SciPy, and local meetup groups. In her free time, Christine loves to travel and tweet.