Poster Open Access

The Turing Way: A Handbook for Reproducible Data Science

The Turing Way Community; Ainsworth, Rachael; Arnold, Becky; Bowler, Louise; Gibson, Sarah; Herterich, Patricia; Higman, Rosie; Krystalli, Anna; Morely, Alexander; O'Reilly, Martin; Whitaker, Kirstie

Poster presentation of the Turing Way at the 2019 Open Science Fair.

Abstract:

The Turing Way is a handbook to support students, their supervisors, funders and journal editors in ensuring that reproducible data science is "too easy not to do" (https://the-turing-way.netlify.com). It includes training material on topics such as version control and analysis testing, and will build upon Alan Turing Institute case studies and workshops. The project also demonstrates open and transparent project management and communication with future users, as it is openly developed at our GitHub repository: https://github.com/alan-turing-institute/the-turing-way. All resources associated with workshops we have delivered, as well as how to organise a Book Dash (a one-day book sprint), are also openly available.

Reproducible research is necessary to ensure that scientific work can be trusted. Funders and publishers are beginning to require that publications include access to the underlying data and the analysis code. The goal is to ensure that all results can be independently verified and built upon in future work, which is sometimes easier said than done. Sharing these research outputs means understanding data management, library sciences, software development, and continuous integration techniques: skills that are not widely taught or expected of academic researchers and data scientists.

This poster will present an overview of the handbook so far and show Open Science Fair participants how they can contribute their knowledge to make it even better going forwards or how to open up their own projects to a wider contributor community. This poster relates to the overall theme of the conference, as the Turing Way provides the tools to improve research habits in a self-contained handbook. It will also ensure that PhD students, postdocs, PIs and funding teams know which parts of the "responsibility of reproducibility" they can affect, and what they should do to nudge research and data science to being more efficient, effective and understandable.

Files (12.6 MB)
Name Size
Ainsworth_TuringWayPoster_OSF19.pdf
md5:b3ab20bb2a75ac0567d015b63359c72b
6.2 MB Download
Ainsworth_TuringWayPoster_OSF19.pptx
md5:f2901d1b499f184b50a6a9216674487b
6.3 MB Download
104
36
views
downloads
All versions This version
Views 104104
Downloads 3636
Data volume 225.3 MB225.3 MB
Unique views 9393
Unique downloads 2929

Share

Cite as