Brought to you by NumFOCUS Foundation and O’Reilly Media
The official Jupyter Conference
Aug 21-22, 2018: Training
Aug 22-24, 2018: Tutorials & Conference
New York, NY

In-Person Training
Reproducible research best practices (highlighting Kaggle Kernels)

Rachael Tatman (Kaggle)
9:00am–5:00pm Tuesday, August 21, 2018
1-Day Training Location: Concourse F
Average rating: *****
(5.00, 1 rating)

This is a 1-Day training course. Participants should plan to attend training courses on both Tuesday and Wednesday. To attend, you must register for a Platinum pass; does not include access to tutorials on Wednesday.

Rachael Tatman shows you how to take an existing research project and make it fully reproducible using Kaggle Kernels. You'll learn best practices for and get hands-on experience with each of the three components necessary for completely reproducible research.

What you'll learn, and how you can apply it

  • Learn how to make your research projects fully reproducible using Kaggle Kernels

This training is for you because...

  • You're a researcher who wants to ensure your work is fully reproducible.

Prerequisites:

  • A working knowledge of Python or R
  • A Kaggle account

Everyone seems to be talking about reproducible research, but how do you actually make sure that your work actually is fully reproducible? Rachael Tatman shows you how to take an existing research project (either your own or a provided example) and make it fully reproducible using Kaggle Kernels. You’ll learn best practices for and get hands-on experience with each of the three components necessary for completely reproducible research:

  • Data: Format and document your data for easy use and reuse
  • Code: Make your code easy for other people (or you in the future) to run and understand
  • Computing environment: Standardize the computing environment your code is run in to ensure consistent output

About your instructor

Photo of Rachael Tatman

Rachael Tatman is a data scientist at Kaggle. She holds a PhD in linguistics from the University of Washington, with a focus in computational sociolinguistics. Her interests include data science education and fairness in machine learning.

Twitter for rctatman

Conference registration

Get the Platinum pass or the Training pass to add this course to your package.

Comments on this page are now closed.

Comments

Rachael Tatman |
08/22/2018 5:20am EDT

Here’s a link to the hosted notebook we used in the course: https://www.kaggle.com/rtatman/reproducible-research-best-practices-jupytercon