There is a newer version of the record available.

Published March 30, 2022 | Version 2022-03-30
Dataset Open

COKI Open Access Dataset

  • 1. Centre for Culture and Technology, Curtin University
  • 2. Curtin Institute for Computation, Curtin University

Description

The COKI Open Access Dataset measures open access performance for 142 countries and 5117 institutions and is available in JSON Lines format. The data is visualised at the COKI Open Access Dashboard: https://open.coki.ac/.

The COKI Open Access Dataset is created with the COKI Academic Observatory data collection pipeline, which fetches data about research publications from multiple sources, synthesises the datasets and creates the open access calculations for each country and institution.

Each week a number of specialised research publication datasets are collected. The datasets that are used for the COKI Open Access Dataset release include Crossref Metadata, Microsoft Academic Graph, Unpaywall and the Research Organization Registry.

After fetching the datasets, they are synthesised to produce aggregate time series statistics for each country and institution in the dataset. The aggregate timeseries statistics include publication count, open access status and citation count. 

See https://open.coki.ac/data/ for the dataset schema. A new version of the dataset is deposited every week.

Code

License
COKI Open Access Dataset © 2022 by Curtin University is licenced under CC BY 4.0.

Attributions
This work contains information from:

Notes

The Curtin Open Knowledge Initiative (COKI) is a strategic initiative of the Research Office at Curtin, the Faculty of Humanities, School of Media, Creative Arts and Social Inquiry and the Curtin Institute for Computation, with additional support from the Andrew W. Mellon Foundation and the Arcadia Fund, a charitable fund of Lisbet Rausing and Peter Baldwin.

Files

coki-oa-dataset.zip

Files (10.1 MB)

Name Size Download all
md5:11c9fb5b6ff58b1e96027b4fa09be181
10.1 MB Preview Download