Dataset Open Access

An open dataset of IPCC reports 'references (6th Assessment Cycle) – Version 1

Alexis-Michel Mugabushaka

We provide a first version of an open dataset of publications cited by the IPCC reports of the 6th Assessment Cycle (Reports — IPCC)

The lists were extracted from the reference sections of three special reports and the first assessment report. [see Figure 1]. The data are presented in two formats: one on hand in text files with a list of references for each section of the reports (generally each chapter) and one other hand a structured format (json) with identifiers for the documents and the sections, the reference in string format as well as the extracted digital object identifiers (dois). In this first version, the dois extracted are mainly those which are provided in the references. The table 1 show the number of references and doi for each report.

We plan, for subsequent releases, following enhancements:

  • Further quality assurance of dois. We note that some entries are provided in references of the IPCC reports without dois although they are indexed in Crossref. The table 2 shows substantial differences in doi coverage among reference sections. Spot checks of the dataset suggest that those differences are mainly due to referencing behaviour of the section’s authors rather than on type of documents cited. We aim in the next version to complete those missing dois and systematically verify the dois included in the references.
  • Expand the references lists to reports from past assessment cycles

In addition, we plan a more detailed documentation of our dataflow & extraction process and to demonstrate how it can be used to create open, community curated, datasets of references of others non-scholarly documents.

The json files have each two keys (1) schema with the structure of the table and (2) data: with the records.

A simple way to read them into table is via a pandas dataframe

import pandas as pd

df = pd.read_json(file_name.json, orient = ‘table’)

 

Acknowledgment

We thank Valentin Hancu (EC/DG ECFIN) - for fruitful discussions on the data extraction process.

Disclaimer: 

The views expressed in this paper are the author’s. They do not reflect the views or official positions of the European Commission.

this version includes also a zip files with the references
Files (7.3 MB)
Name Size
Figure_1_IPCC6th_Timeline.jpg
md5:825c52e8f62c1c64891bf59d3c9abeb6
102.7 kB Download
IPCC6_references_vers_1.zip
md5:e437e2e2a1818f11799278b3332de641
7.1 MB Download
Table_1_overview_references_by_reports.png
md5:537464c7db66406aa59b452bf06cd3e2
16.2 kB Download
Table_2_overview_references_by_section.png
md5:09bd8fc87685b9aef321d9f6b50ee37c
122.1 kB Download
387
83
views
downloads
All versions This version
Views 387336
Downloads 8377
Data volume 231.1 MB230.4 MB
Unique views 311287
Unique downloads 5651

Share

Cite as