Published December 7, 2020 | Version v1
Report Open

The Role of Data in AI


This is the report from the project The Role of Data in AI, which was commissioned by the Data Governance Working Group of the Global Partnership of AI (GPAI).  The consortium was led by the Digital Curation Centre, with partners Trilateral Research and The School of Informatics, the University of Edinburgh. The report digs more deeply into the issues raised within the Data Governance Framework (see below), and identifies areas where GPAI could make an impact in deepening international collaboration.

It covers the following areas:

  • Al development and the role of data at each step;
  • Data types used in AI development;
  • Data characteristics that influence the process or outcome of Al development;
  • Socio-ethical, economic and environmental impacts of data in Al;
  • Law and transparency as modifiers to impacts of data in Al;
  • Availability of accessibility to data for Al development; data quality and challenges in three fields (pandemic response, human language technologies for under-resourced languages, and AI applications in the criminal justice system);
  • and recommendations on where GPAI could enhance international collaboration on data governance.

The report was prepared for GPAIs first summit in Montréal in December 2020 and presented along with the following related documents from the Data Governance WG:


This report was commissioned by experts of the Global Partnership on Artificial Intelligence's Working Group on Data Governance. The report does not necessarily reflect the views of the experts' organizations, GPAI, the OECD or their respective members.


Role of Data Report final version 07122020.pdf

Files (851.6 kB)

Name Size Download all
851.6 kB Preview Download