COKI Open Access Dataset
Creators
- 1. Centre for Culture and Technology, Curtin University
- 2. Curtin Institute for Data Science, Curtin University
Description
The COKI Open Access Dataset measures open access performance for 225 countries and 50,000 institutions and is available in JSON Lines format. The data is visualised at the COKI Open Access Dashboard: https://open.coki.ac/.
The COKI Open Access Dataset is created with the COKI Academic Observatory data collection pipeline, which fetches data about research publications from multiple sources, synthesises the datasets and creates the open access calculations for each country and institution.
Each week a number of specialised research publication datasets are collected. The datasets that are used for the COKI Open Access Dataset release include Crossref Metadata, OpenAlex, Unpaywall and the Research Organization Registry.
After fetching the datasets, they are synthesised to produce aggregate time series statistics for each country and institution in the dataset. The aggregate timeseries statistics include publication count, open access status and citation count.
See https://open.coki.ac/data/ for the dataset schema. A new version of the dataset is deposited every week.
Code
- The COKI Academic Observatory data collection pipeline is used to create the dataset.
- The COKI OA Website Github project contains the code for the web app that visualises the dataset at open.coki.ac. It can be found on Zenodo here.
License
COKI Open Access Dataset © 2022 by Curtin University is licenced under CC BY 4.0.
Attributions
This work contains information from:
- OpenAlex which is made available under the CC0 license.
- Crossref Metadata via the Metadata Plus program. Bibliographic metadata is made available without copyright restriction and Crossref generated data under a CC0 licence. See metadata licence information for more details.
- Unpaywall. The Unpaywall Data Feed is used under license. Data is freely available from Unpaywall via the API, data dumps and as a data feed.
- Research Organization Registry which is made available under a CC0 licence.
Files
coki-oa-dataset.zip
Files
(129.9 MB)
Name | Size | Download all |
---|---|---|
md5:992117365be98c79b7961ed510236b1b
|
129.9 MB | Preview Download |