There is a newer version of the record available.

Published March 16, 2022 | Version 5.0
Dataset Open

OpenAIRE ScholeXplorer Service: Scholix JSON Dump

  • 1. ISTI - CNR

Description

This dataset contains the GZ-compressed dump of the Scholix links (schema Version 4) exposed by the OpenAIRE ScholeXplorer service. The dataset consists of 417+Mi bi-directional links (i.e. 831+Mi directed links) between literature-dataset and dataset-dataset involving 30+ Mi literature objects and 32+ Mi datasets. Links were collected from publishers (CrossRef, EventData), data centers (DataCite and data centers), institutional/thematic repositories (OpenAIRE), life-science databases (EMBL-EBI), and inferred via text-mining by OpenAIRE. The links are organized in 21 compressed files, each of at most ~10 Gb, for a total of ~205GB.

Note that the dataset matches a new version of the schema (schema Version 4). Changes are limited and backward compatible, and regard optional fields and extensions of vocabularies.

The readme.doc file includes changes to the dump schema and statistics about the data.

Notes

When citing this dataset, please cite the related publication also: "Uwe Schindler, Hylke Koers, Sandro La Bruzzo, Adrian Burton, Michael Diepenbroek, Amir Aryani, & Paolo Manghi. (2020). The data-literature interlinking service: Towards a common infrastructure for sharing data-article links. Program, Emerald Publisher, 51 (0033–0337), 75–100. Doi: 10.1108/prog-06-2016-0048"

Files

Files (220.6 GB)

Name Size Download all
md5:e3105c1e9a71f5c2729ecd85764ab684
47.6 kB Download
md5:2b6411c7aa22d42695548ee6663181cf
10.8 GB Download
md5:a2cf7fbf2554d5fb77bd8278171406de
10.8 GB Download
md5:51a138cbd3b04847f21a418006aa76a0
10.8 GB Download
md5:61072d4b7cf1ca1b4fe7cee6d4784c44
10.7 GB Download
md5:53d273f227666092541d1c0ea2facb1b
10.8 GB Download
md5:aa1c62d93749d23f8a19a429347969ab
10.8 GB Download
md5:2156e5398a268f0f8b0620dd1abbd726
10.7 GB Download
md5:50719133dd488d0a8c13f8a6edc228d6
10.8 GB Download
md5:67ae0573dea6c85232722a0a804f90b5
10.7 GB Download
md5:521bf3d8b6d24aaad24a80f5dfd357e7
10.8 GB Download
md5:5a9d49c21d48f0592d9af2fa551fe99b
10.7 GB Download
md5:44e299512ae208d1795a67a4ef9d836a
10.8 GB Download
md5:3755450f6745517e898518002921c144
10.8 GB Download
md5:8a332ebe670bab5e04783fb28d6abc14
5.5 GB Download
md5:f31c9ff875700730b26f6a281b81548e
10.8 GB Download
md5:b3216c292881fc934522aeaa513f65e1
10.7 GB Download
md5:6a39011c0ee69b196967f9a120b287ee
10.8 GB Download
md5:43ffd4964d2df8dbb4c047944d114d08
10.7 GB Download
md5:9f73fbc66a8ccfb9cd4e3e31991d83e4
10.8 GB Download
md5:3e4fa2783e7f44c1811b324c8c28bfaa
10.8 GB Download
md5:551e1c04d8c4f6169fdbe9609e0088f9
10.7 GB Download

Additional details

Related works

Is documented by
Dataset: 10.5281/zenodo.6351557 (DOI)

Funding

OpenAIRE-Advance – OpenAIRE Advancing Open Scholarship 777541
European Commission
OpenAIRE Nexus – OpenAIRE-Nexus Scholarly Communication Services for EOSC users 101017452
European Commission
OpenAIRE2020 – Open Access Infrastructure for Research in Europe 2020 643410
European Commission