There is a newer version of the record available.

Published July 28, 2023 | Version 2023-07-28
Dataset Open

COKI Open Access Dataset

  • 1. Centre for Culture and Technology, Curtin University
  • 2. Curtin Institute for Computation, Curtin University

Description

The COKI Open Access Dataset measures open access performance for 221 countries and 14,477 institutions and is available in JSON Lines format. The data is visualised at the COKI Open Access Dashboard: https://open.coki.ac/.

The COKI Open Access Dataset is created with the COKI Academic Observatory data collection pipeline, which fetches data about research publications from multiple sources, synthesises the datasets and creates the open access calculations for each country and institution.

Each week a number of specialised research publication datasets are collected. The datasets that are used for the COKI Open Access Dataset release include Crossref Metadata, OpenAlex, Unpaywall and the Research Organization Registry.

After fetching the datasets, they are synthesised to produce aggregate time series statistics for each country and institution in the dataset. The aggregate timeseries statistics include publication count, open access status and citation count. 

See https://open.coki.ac/data/ for the dataset schema. A new version of the dataset is deposited every week.

Code

License
COKI Open Access Dataset © 2022 by Curtin University is licenced under CC BY 4.0.

Attributions
This work contains information from:

Notes

The Curtin Open Knowledge Initiative (COKI) is a strategic initiative of the Research Office at Curtin, the Faculty of Humanities, School of Media, Creative Arts and Social Inquiry and the Curtin Institute for Computation, with additional support from the Mellon Foundation and the Arcadia Fund, a charitable fund of Lisbet Rausing and Peter Baldwin.

Files

coki-oa-dataset.zip

Files (36.3 MB)

Name Size Download all
md5:09c9f963b55c0103343fccca2177b431
36.3 MB Preview Download