Tools and practices for collaborative, reproducible data science

An introduction to “good enough” practices to create a shared virtual environment allowing you to develop reproducible workflows for your analysis and manage your data.

Julien Brun , Carrie Kappel , Julie Stewart Lowndes

Curriculum at a glance

This module is an introduction to the data science support NCEAS is providing to LTER and SNAPP working groups followed by a discussion on best practices about data management in a distributed team setup. Participants will have the opportunity to brainstorm on their data and computing needs. In the second part of the workshop, an introduction to the use of NCEAS analytical server and the concept of collaborative coding as a distributed team will be demonstrated to empower participants to develop their analytical workflows in a remote setup.

Workshop material:

Blog post:


If you see mistakes or want to suggest changes, please create an issue on the source repository.


Text and figures are licensed under Creative Commons Attribution CC BY 4.0. Source code is available at, unless otherwise noted. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".