There is a newer version of this record available.

Dataset Open Access

DOIBoost Dataset Dump

La Bruzzo, Sandro; Manghi, Paolo; Mannocci, Andrea


Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
  <dc:creator>La Bruzzo, Sandro</dc:creator>
  <dc:creator>Manghi, Paolo</dc:creator>
  <dc:creator>Mannocci, Andrea</dc:creator>
  <dc:date>2018-09-28</dc:date>
  <dc:description> 

Research in information science and scholarly communication strongly relies on the availability of openly accessible datasets of metadata and, where possible, their relative payloads. To this end, CrossRef plays a pivotal role by providing free access to its entire metadata collection, and allowing other initiatives to link and enrich its information. Therefore, a number of key pieces of information result scattered across diverse datasets and resources freely available online. As a result of this fragmentation, researchers in this domain end up struggling with daily integration problems producing a plethora of ad-hoc datasets, therefore incurring in a waste of time, resources, and infringing open science best practices. DOIBoost is a metadata collection that enriches CrossRef with inputs from Microsoft Academic Graph, ORCID, and Unpaywall for the purpose of supporting high-quality and robust research experiments, saving times to researchers and enabling their comparison.

This entry consists of two files: doiBoost.tar.gz (which contains a set of part.gz files, each one containing the JSON files realtive to the enriched CrossRef records) and termsOfUse.doc (which contains details on the terms of use of DOIBoost).

Note that this records comes with two relationships to other results of this experiment: 


	link to the data paper: for more information on how the dataset is (and can be) generated;
	link to the software: to repeat the experiment  .


 </dc:description>
  <dc:identifier>https://zenodo.org/record/1438356</dc:identifier>
  <dc:identifier>10.5281/zenodo.1438356</dc:identifier>
  <dc:identifier>oai:zenodo.org:1438356</dc:identifier>
  <dc:language>eng</dc:language>
  <dc:relation>info:eu-repo/grantAgreement/EC/H2020/777541/</dc:relation>
  <dc:relation>doi:10.5281/zenodo.1441058</dc:relation>
  <dc:relation>doi:10.5281/zenodo.1441072</dc:relation>
  <dc:relation>doi:10.5281/zenodo.1438355</dc:relation>
  <dc:relation>url:https://zenodo.org/communities/openaire</dc:relation>
  <dc:relation>url:https://zenodo.org/communities/openaire-research-graph</dc:relation>
  <dc:rights>info:eu-repo/semantics/openAccess</dc:rights>
  <dc:rights>https://creativecommons.org/licenses/by/4.0/legalcode</dc:rights>
  <dc:subject>dataset</dc:subject>
  <dc:subject>CrossRef</dc:subject>
  <dc:subject>Microsoft Academic Graph</dc:subject>
  <dc:subject>Unpaywall</dc:subject>
  <dc:subject>Spark</dc:subject>
  <dc:subject>aggregation</dc:subject>
  <dc:subject>metadata</dc:subject>
  <dc:subject>enrichment</dc:subject>
  <dc:subject>ORCID</dc:subject>
  <dc:title>DOIBoost Dataset Dump</dc:title>
  <dc:type>info:eu-repo/semantics/other</dc:type>
  <dc:type>dataset</dc:type>
</oai_dc:dc>
2,444
3,563
views
downloads
All versions This version
Views 2,4441,000
Downloads 3,563617
Data volume 172.3 TB21.7 TB
Unique views 2,073835
Unique downloads 735220

Share

Cite as