Dataset Open Access

Analysis of references in the IPCC AR6 WG2 Report of 2022

Cameron Neylon; Bianca Kramer


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">IPCC, Crossref, references, DOIs</subfield>
  </datafield>
  <controlfield tag="005">20220311014925.0</controlfield>
  <datafield tag="500" ind1=" " ind2=" ">
    <subfield code="a">Archived version of Release 2022-03-10 of GitHub repository: 
https://github.com/Curtin-Open-Knowledge-Initiative/ipcc-ar6</subfield>
  </datafield>
  <controlfield tag="001">6344388</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Utrehct University</subfield>
    <subfield code="0">(orcid)0000-0002-5965-6560</subfield>
    <subfield code="a">Bianca Kramer</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">11256803</subfield>
    <subfield code="z">md5:903dffc03d6000b0a6d6e529487284e0</subfield>
    <subfield code="u">https://zenodo.org/record/6344388/files/ipcc-ar6-0.9.zip</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2022-03-10</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="p">user-coki</subfield>
    <subfield code="o">oai:zenodo.org:6344388</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Centre for Culture and Technology, Curtin University</subfield>
    <subfield code="0">(orcid)0000-0002-0068-716X</subfield>
    <subfield code="a">Cameron Neylon</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Analysis of references in the IPCC AR6 WG2 Report of 2022</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-coki</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/publicdomain/</subfield>
    <subfield code="a">Creative Commons Public Domain Dedication and Certification</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;This repository contains data on 17,419&amp;nbsp;DOIs cited in the&amp;nbsp;&lt;a href="https://www.ipcc.ch/report/ar6/wg2/"&gt;IPCC Working Group 2 contribution to the Sixth Assessment Report&lt;/a&gt;, and the code to link them to the dataset built at the Curtin Open Knowledge Initiative (COKI).&lt;/p&gt;

&lt;p&gt;References were extracted from the report&amp;#39;s PDFs (downloaded 2022-03-01) via&amp;nbsp;&lt;a href="https://www.scholarcy.com/"&gt;Scholarcy&lt;/a&gt;&amp;nbsp;and exported as RIS and BibTeX files. DOI strings were identified from RIS files by pattern matching and saved as CSV file. The list of DOIs for each chapter and cross chapter paper was processed using a custom Python script to generate a pandas DataFrame which was saved as CSV file and uploaded to Google Big Query.&lt;/p&gt;

&lt;p&gt;We used the main object table of the Academic Observatory, which combines information from Crossref, Unpaywall, Microsoft Academic, Open Citations, the Research Organization Registry and Geonames to enrich the DOIs with bibliographic information, affiliations, and open access status. A custom query was used to join and format the data and the resulting table was visualised in a Google DataStudio dashboard.&lt;br&gt;
&lt;br&gt;
This version of the repository also includes the set of DOIs from references in the &lt;a href="https://www.ipcc.ch/report/ar6/wg1/"&gt;IPCC Working Group 1 contribution to the Sixth Assessment Report&lt;/a&gt;&amp;nbsp;as extracted by Alexis-Michel Mugabushaka and shared on Zenodo: &lt;a href="https://doi.org/10.5281/zenodo.5475442"&gt;https://doi.org/10.5281/zenodo.5475442&lt;/a&gt; (CC-BY)&lt;/p&gt;

&lt;p&gt;A brief descriptive analysis was provided as a &lt;a href="https://openknowledge.community/tracking-climate-change-openaccess/"&gt;blogpost on the COKI website&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The repository contains the following content:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Data:&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;&lt;strong&gt;data/scholarcy/RIS/&lt;/strong&gt;&amp;nbsp;- extracted references as RIS files&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;data/scholarcy/BibTeX/&lt;/strong&gt;&amp;nbsp;- extracted references as BibTeX files&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;IPCC_AR6_WGII_dois.csv&lt;/strong&gt;&amp;nbsp;- list of DOIs&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;data/10.5281_zenodo.5475442/&lt;/strong&gt; - references from IPCC AR6 WG1 report&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Processing:&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;&lt;strong&gt;preprocessing.R&lt;/strong&gt;&amp;nbsp;- preprocessing steps for identifying and cleaning DOIs&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;process.py&lt;/strong&gt;&amp;nbsp;- Python script for transforming data and linking to COKI data through Google Big Query&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Outcomes:&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;&lt;a href="https://console.cloud.google.com/bigquery?project=utrecht-university&amp;amp;ws=!1m23!1m3!8m2!1s145441926252!2sd59dfac7972a45f8a2f5ee4ac866c34d!1m4!4m3!1sacademic-observatory!2sobservatory!3sdoi20220226!1m4!4m3!1sutrecht-university!2sipcc_ar6!3sdoi_table!1m3!3m2!1sutrecht-university!2sipcc_ar6!1m4!4m3!1sutrecht-university!2sipcc_ar6!3sipcc_ar6_dois&amp;amp;d=ipcc_ar6&amp;amp;p=utrecht-university&amp;amp;page=table&amp;amp;t=doi_table&amp;amp;pli=1&amp;amp;authuser=1"&gt;Dataset on BigQuery&lt;/a&gt;&amp;nbsp;- requires a google account for access and bigquery account for querying&lt;/li&gt;
	&lt;li&gt;&lt;a href="https://datastudio.google.com/s/vZN2zLr9wS4"&gt;Data Studio Dashboard&lt;/a&gt;&amp;nbsp;- interactive analysis of the generated data&lt;/li&gt;
	&lt;li&gt;&lt;a href="https://www.zotero.org/groups/4614109"&gt;Zotero library&lt;/a&gt; of references extracted via Scholarcy&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;PDF version of blogpost&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Note on licenses:&lt;/strong&gt;&amp;nbsp;&lt;br&gt;
Data are made available under&amp;nbsp;&lt;a href="https://creativecommons.org/publicdomain/zero/1.0/"&gt;CC0&lt;/a&gt;&amp;nbsp;(with the exception of WG1 reference data, which have been shared under &lt;a href="https://creativecommons.org/licenses/by/4.0/legalcode"&gt;CC-BY 4.0&lt;/a&gt;)&lt;br&gt;
Code is made available under&amp;nbsp;&lt;a href="http://www.apache.org/licenses/"&gt;Apache License 2.0&lt;/a&gt;&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">url</subfield>
    <subfield code="i">isIdenticalTo</subfield>
    <subfield code="a">https://github.com/Curtin-Open-Knowledge-Initiative/ipcc-ar6/releases/tag/v0.8</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">references</subfield>
    <subfield code="a">10.5281/zenodo.5475442</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.6327206</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.6344388</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>
802
42
views
downloads
All versions This version
Views 802279
Downloads 4216
Data volume 369.1 MB180.1 MB
Unique views 709254
Unique downloads 3816

Share

Cite as