Planned intervention: On Thursday 19/09 between 05:30-06:30 (UTC), Zenodo will be unavailable because of a scheduled upgrade in our storage cluster.
Published March 16, 2022 | Version 6.0
Dataset Open

OpenAIRE ScholeXplorer Service: Scholix JSON Dump

  • 1. ISTI - CNR

Description

This dataset contains the GZ-compressed dump of the Scholix links (schema Version 4) exposed by the OpenAIRE ScholeXplorer service. It consists of 417+Mi bi-directional links (i.e. 975+Mi directed links) between literature-dataset and dataset-dataset involving 24+ Mi literature objects and 37+ Mi datasets (showing an increase of around 160Mi links wrt the previous release). Links are collected from publishers (CrossRef, EventData), data centers (DataCite and data centers), institutional/thematic repositories (OpenAIRE), life-science databases (EMBL-EBI), and inferred by OpenAIRE via text-mining around 14Mi publication's PDFs. The dataset is structured in 30 compressed files, each of at most ~10 Gb, for a total of ~328GB.

Note that the dataset matches a new version of the schema (schema Version 4). Changes are minor, backward compatible, and regard optional fields and extensions of vocabularies. The readme.doc file includes a description of the schema changes and statistics about the dataset.

Notes

When citing this dataset, please cite the related publication also: "Uwe Schindler, Hylke Koers, Sandro La Bruzzo, Adrian Burton, Michael Diepenbroek, Amir Aryani, & Paolo Manghi. (2020). The data-literature interlinking service: Towards a common infrastructure for sharing data-article links. Program, Emerald Publisher, 51 (0033–0337), 75–100. Doi: 10.1108/prog-06-2016-0048"

Files

Files (352.8 GB)

Name Size Download all
md5:ed7e5eed5d88e6a207856e1b2f302f64
54.8 kB Download
md5:aedf991dc853d32177fc8d493a544c20
10.8 GB Download
md5:192b459659ed45a38ff2885e541b7c31
10.8 GB Download
md5:190a35705855fd84e23e6f57cdaaa82a
10.8 GB Download
md5:e61599c8954c773eaf20ccf4286ae491
10.8 GB Download
md5:aa1bbf5cfb05f43ba7f6473818e1a759
10.8 GB Download
md5:5461a09562daa0741f8e66ce4ecc0a47
10.8 GB Download
md5:f485fd460bf1f094d780e56f4b104b5c
10.8 GB Download
md5:23704b2452e7d83290a32879fbae4a77
10.8 GB Download
md5:48aa506103ff610d947cbc01a86dfb13
10.8 GB Download
md5:c736d873d95cad18821a753b298e1f08
10.8 GB Download
md5:354de314ee427975768edcc78c3ab11e
10.8 GB Download
md5:5f86de6093c03e674fb0bc7da78f53c0
10.8 GB Download
md5:8824ddc90cabdc16e0f08016732eeed0
10.8 GB Download
md5:b5c1fcd3b0eea208204f4ad69d5244f6
10.8 GB Download
md5:7460c4bd662d3e2e36e14c157303a580
10.8 GB Download
md5:c2cc491ec0e8f29b256e4aa7c6962371
10.8 GB Download
md5:7ace039b9e4c8d2a070fcd674b5379f4
10.8 GB Download
md5:4908c321f027c021aab98ca33b74597d
10.8 GB Download
md5:f0208b953e2bbce8df524f7502a4c391
10.8 GB Download
md5:0ceeb5179f51c5441c7aee31d6584cb4
10.8 GB Download
md5:13bf3fbcc2451e45f7043813edd8c92b
10.8 GB Download
md5:b69b911fd246ed4c8f04e405b61e37da
10.8 GB Download
md5:11445481f38ba301339389ad8802c8c6
10.8 GB Download
md5:1ea9ac3ee972da42142045602c82257a
10.8 GB Download
md5:10840030cb38c0a39322add71f2928a3
10.8 GB Download
md5:269bff33c9bf106bdc8289547e92eeb9
10.8 GB Download
md5:6bc98130fd765c324d29d580a1372a72
8.5 GB Download
md5:8d0ff73e862f1731eee78bdbf5c0300d
10.8 GB Download
md5:5c10a8f345bc86dadff6e7924711308c
10.8 GB Download
md5:5e85eaff20ece85dbd715631212fffd9
10.8 GB Download
md5:29f0965fc163ccd6b2e7d5d503ef51bf
10.8 GB Download
md5:00333d010633dc4f38c1150568ee067b
10.8 GB Download
md5:8fe2df548c8c00fb582d0b0f849df27b
10.8 GB Download

Additional details

Related works

Is documented by
Dataset: 10.5281/zenodo.6351557 (DOI)

Funding

OpenAIRE-Advance – OpenAIRE Advancing Open Scholarship 777541
European Commission
OpenAIRE Nexus – OpenAIRE-Nexus Scholarly Communication Services for EOSC users 101017452
European Commission
OpenAIRE2020 – Open Access Infrastructure for Research in Europe 2020 643410
European Commission