10.5281/zenodo.1456175
https://zenodo.org/records/1456175
oai:zenodo.org:1456175
La Bruzzo, Sandro
Sandro
La Bruzzo
0000-0003-2855-1245
Institute of Information Science and Technology - CNR
Manghi, Paolo
Paolo
Manghi
0000-0001-7291-3210
Institute of Information Science and Technology - CNR
Mannocci, Andrea
Andrea
Mannocci
0000-0002-5193-7851
Knowledge Media Institute - Open University
OpenAIRE's DOIBoost - Boosting CrossRef for Research
Zenodo
2018
data paper
dataset
ORCID
Microsoft Academic Graph
Unpaywall
CrossRef
reproducible science
metadata
aggregation
2018-10-01
eng
10.5281/zenodo.1441058
10.5281/zenodo.1438356
10.5281/zenodo.1441071
https://zenodo.org/communities/ircdl
https://zenodo.org/communities/openaire
https://zenodo.org/communities/eu
2.0
Creative Commons Attribution 4.0 International
Research in information science and scholarly communication strongly relies on the availability of openly accessible datasets of scholarly entities metadata and, where possible, their relative payloads. Since such metadata information is scattered across diverse, freely accessible, online resources (e.g. CrossRef, ORCID), researchers in this domain are doomed to struggle with metadata integration problems, in order to produce custom datasets of undocumented and rather obscure provenance. This practice leads to waste of time, duplication of efforts, and typically infringes open science best practices of transparency and reproducibility of science. In this article, we describe how to generate DOIBoost, a metadata collection that enriches CrossRef (May 2018) with inputs from Microsoft Academic Graph (May 2018), ORCID (Dec 2017), and Unpaywall (Dec 2017) for the purpose of supporting high-quality and robust research experiments, saving times to researchers and enabling their comparison. To this aim, we describe the dataset value and its schema, analyse its actual content, and share the software Toolkit and experimental workflow required to reproduce it. The DOIBoost dataset and Software Toolkit are made openly available via Zenodo.org. DOIBoost will become an input source to the OpenAIRE information graph.
This is the pre-print of a data paper submitted to the IRCDL 2019 conference: http://ircdl2019.isti.cnr.it
European Commission
10.13039/501100000780
777541
OpenAIRE Advancing Open Scholarship