Planned intervention: On Wednesday April 3rd 05:30 UTC Zenodo will be unavailable for up to 2-10 minutes to perform a storage cluster upgrade.
Published November 16, 2019 | Version v1
Presentation Open

The Turing Way: Reproducible, Inclusive, Collaborative Data Science

  • 1. The Alan Turing Institute

Description

Slides from Kirstie's keynote at PyData Cambridge on 16 November 2019

Abstract: Reproducible research is necessary to ensure that scientific work can be trusted. By sharing data, analysis code and the computational environment used to generate the results, researchers can more effectively stand on the shoulders of their peers and colleagues and deliver high quality, trustworthy and verifiable outputs. This requires skills in data management, library sciences, software development, and continuous integration techniques: skills that are not widely taught or expected of academic researchers. Skills that are unreasonable, in fact, to expect in one individual team member. Even worse, they are not sufficient for ethical, transparent, collaborative, participatory and well designed data science! The Turing Way is a handbook to support students, their supervisors, industry data scientists, team leaders, funders, journal editors, and policy makers in ensuring that reliable and impactful data science is "too easy not to do". It includes training material on version control, analysis testing, collaborating in distributed groups, open and transparent communication skills, and effective management of diverse research projects. The Turing Way is openly developed and any and all questions, comments and recommendations are welcome at our github repository: https://github.com/alan-turing-institute/the-turing-way. In this talk, Kirstie Whitaker, lead developer of The Turing Way, will take you on a whirlwind tour of the chapters that already exist and the directions in which we're continuing to develop including ethical considerations,research project design, scoping across a broad range of incentives and ways of working, and effective communication strategies. All participants will leave the talk knowing that "Every Little Helps" when making their work reproducible, where to ask for help as they start or continue their open research journey, and how they can contribute to improve The Turing Way for future readers.

Bio: Kirstie Whitaker is a research fellow at the Alan Turing Institute (London, UK) and senior research associate in the Department of Psychiatry at the University of Cambridge. Her work covers a broad range of interests and methods, but the driving principle is to improve the lives of neurodivergent people and people with mental health conditions. Dr Whitaker uses magnetic resonance imaging to study child and adolescent brain development and participatory citizen science to educate non-autistic people about how they can better support autistic friends and colleagues. She is the lead developer of The Turing Way, an openly developed educational resource to enable more reproducible data science. Kirstie is a passionate advocate for making science "open for all" by promoting equity and inclusion for people from diverse backgrounds, and by changing the academic incentive structure to reward collaborative working. She is the chair of the Turing Institute's Ethics Advisory Group, a Fulbright scholarship alumna and was a 2016/17 Mozilla Fellow for Science. Kirstie was named, with her collaborator Petra Vertes, as a 2016 Global Thinker by Foreign Policy magazine. You can find more information at her lab website: whitakerlab.github.io.

Useful links

Notes

This work was supported by The UKRI Strategic Priorities Fund under the EPSRC Grant EP/T001569/1, particularly the "Tools, Practices and Systems" theme within that grant, and by The Alan Turing Institute under the EPSRC grant EP/N510129/1.

Files

Whitaker_PyDataCambridge_November2019.pdf

Files (88.6 MB)

Name Size Download all
md5:8b1adb36d1892a104d5edf3602b60d50
2.7 MB Preview Download
md5:105357656977e25291d3ddb9b7e6532b
85.9 MB Download