Published December 12, 2023 | Version v1
Presentation Open

Participatory data stewardship in AI


The final Data Futures Lab Community Call in 2023 is tomorrow, Tuesday, December 12! Anne Lee Steele and Jennifer Ding of the Alan Turing Institute will talk about open, participatory approaches to data stewardship, drawing from their experiences working with projects like BigScience, BigCode, and The Turing Way.

BigScience is by now, a well known research initiative that created the BLOOM (BigScience Large Open-science Open-access Multilingual) Language Model. BigCode is the next iteration of BigScience, focused on code-generating Large Language Models. The data and models created from these projects are hosted by, and in partnership with HuggingFace.

The Turing Way is a distributed community of researchers and practitioners from data science related fields who actively contribute to a handbook of tools and best practices to ensure that conducting open, responsible, localised, and collaborative data science is "too easy not to do." I personally reference it all the time, especially to talk about the nuances in open licensing as it relates to AI, from models, to datasets, to traditional software licenses.

The call is at 11am EST / 4pm BST / 5pm CET. Sign up here to attend! In the meantime, you can learn more about the projects from the links above and take a look at this paper. We look forward to seeing you there.



Files (11.1 MB)

Name Size Download all
11.1 MB Preview Download