Dataset Open Access
List of citations from the English Wikipedia articles extracted from the enwiki-20170720-pages-articles XML dump via https://pypi.org/project/mwcites/ , DOIs cleaned with custom regular expressions.
The list of scholarly publications identified by the DOIs has been filtered to exclude those which are already available in Open Access and those which may not be depositable according to SHERPA/RoMEO policy summaries, first via the Dissemin API and then by the oaDOI API, with the attached Python script (https://github.com/nemobis/bots/blob/master/doi-doai-openaccess.py ).
This produced a list of 194913 DOIs available in open access and 430230 DOIs unavailable and depositable (as of 2017-08-22 data, which for oaDOI was partly v1 and partly v2).
Name | Size | |
---|---|---|
COPYING
md5:7acfe7ceb7a177e3b44e7c671a17836f |
60 Bytes | Download |
doi-doai-openaccess.py
md5:d8fccb4bb7d99cafd4c946ceed6ab30a |
8.4 kB | Download |
enwiki-20170720-available-DOIs.txt.xz
md5:5f0765c36bc3e3cfc8e740e1881091f6 |
547.0 kB | Download |
enwiki-20170720-depositable-DOIs.txt.xz
md5:fa54610cd5803e8c89569121f3a73b6a |
1.2 MB | Download |
enwiki-20170720-depositable-oadoi.log.xz
md5:7ec4d72e09900cf217d401adb881c76f |
2.5 MB | Download |
enwiki-20170720-depositable.log.xz
md5:8916c753807014f857df8c949f934135 |
2.6 MB | Download |
enwiki-20170720-pages-articles-citations.tsv.xz
md5:a4be65b54d464d7b7f875c803d495dbe |
37.6 MB | Download |
mwcites-dois.sed
md5:00c03c4fcdca15ffa15ab728d01cf38c |
350 Bytes | Download |
Piwowar H, Priem J, Larivière V, Alperin JP, Matthias L, Norlander B, Farley A, West J, Haustein S. 2018. The state of OA: a large-scale analysis of the prevalence and impact of Open Access articles. PeerJ 6:e4375 https://doi.org/10.7717/peerj.4375
All versions | This version | |
---|---|---|
Views | 737 | 738 |
Downloads | 212 | 212 |
Data volume | 1.2 GB | 1.2 GB |
Unique views | 675 | 676 |
Unique downloads | 121 | 121 |