Published September 9, 2019 | Version v1
Presentation Open

The Turing Way: A Handbook for Reproducible Data Science

  • 1. University of Manchester

Description

Demo presentation of the Turing Way at the 2019 Open Science Fair.

Abstract:

The Turing Way is a handbook to support students, their supervisors, funders and journal editors in ensuring that reproducible data science is "too easy not to do" (https://the-turing-way.netlify.com). It includes training material on topics such as version control and analysis testing, and will build upon Alan Turing Institute case studies and workshops. The project also demonstrates open and transparent project management and communication with future users, as it is openly developed at our GitHub repository: https://github.com/alan-turing-institute/the-turing-way. All resources associated with workshops we have delivered, as well as how to organise a Book Dash (a one-day book sprint), are also openly available.

Reproducible research is necessary to ensure that scientific work can be trusted. Funders and publishers are beginning to require that publications include access to the underlying data and the analysis code. The goal is to ensure that all results can be independently verified and built upon in future work, which is sometimes easier said than done. Sharing these research outputs means understanding data management, library sciences, software development, and continuous integration techniques: skills that are not widely taught or expected of academic researchers and data scientists.

During this session, we will lead a collaborative review of the handbook so far and show Open Science Fair participants how they can contribute their knowledge to make it even better going forwards or how to open up their own projects to a wider contributor community. This demo relates to the overall theme of the conference, as the Turing Way provides the tools to improve research habits in a self-contained handbook. It will also ensure that PhD students, postdocs, PIs and funding teams know which parts of the "responsibility of reproducibility" they can affect, and what they should do to nudge research and data science to being more efficient, effective and understandable.

Files

Ainsworth_TuringWayDemo_OSF19.pdf

Files (23.4 MB)

Name Size Download all
md5:e0b3843c4376828bb08e59fc71fc623e
1.3 MB Preview Download
md5:7bec4c54f662c96705cdf99540d577c2
22.1 MB Download