There is a newer version of the record available.

Published October 14, 2021 | Version v1
Dataset Open

OC-782K: Knowledge Graph of "Scientometrics" modelled according to the OpenCitations Data Model

Authors/Creators

Description

This dataset is a knowledge graph extracted from a triplestore covering information about the journal Scientometrics and modelled according to the OpenCitations Data Model. The original triplestore is available here. This KG was extracted for a research project on knowledge graph embeddings (KGEs) for author disambiguation. Structural triples of the knowledge graph are split into training, testing and validation for applying representation learning methods. Textual literals and numeric literals were stored separately in order to implement multimodal approaches for KGEs (see arXiv:1802.00934). For the same reason, textual literals and numeric literals are already stored into sentence embeddings and a numeric matrix respectively in the files textual_literals.npy and numeric_literals.npy. For the script used to gather this dataset see the GitHub repository: https://github.com/sntcristian/and-kge/tree/main/open-citations.

Files

dataset_statistics.json

Files (1.0 GB)

Name Size Download all
md5:71fa8025f828803f111d7f09ba93a3fe
449 Bytes Preview Download
md5:dcd4c5b2dfd06a98563153f64d07e83e
18.8 MB Preview Download
md5:d4c371f7c5969c4d9750b3bc8d821c12
1.2 MB Download
md5:f5d9338832ae1581485aa5fb713809de
6.8 MB Preview Download
md5:5155e0d66879717d74066a6d51bf2f0b
17.3 MB Preview Download
md5:855669b2ace77914f229cfdb40bf1930
900.7 MB Download
md5:9569339946d814c37659289912aa8701
14.4 MB Preview Download
md5:3bffa2611641ab032dda4540d40b29b1
55.6 MB Preview Download
md5:bb75b9668301c5b50d22fee09f35db47
13.8 MB Preview Download