Planned intervention: On Wednesday April 3rd 05:30 UTC Zenodo will be unavailable for up to 2-10 minutes to perform a storage cluster upgrade.
Published August 17, 2020 | Version v1
Conference paper Open

Context-Driven Discoverability of Research Data

  • 1. CNR-ISTI, Pisa, Italy

Description

Research data sharing has been proved to be key for accelerating scientific progress and fostering interdisciplinary research; hence, the ability to search, discover and reuse data items is nowadays vital in doing science. However, research data discovery is yet an open challenge. In many cases, descriptive metadata exhibit poor quality, and the ability to automatically enrich metadata with semantic information is limited by the data files format, which is typically not textual and hard to mine. More generally, however, researchers would like to find data used across different research experiments or even disciplines. Such needs are not met by traditional metadata description schemata, which are designed to freeze research data features at deposition time.

In this paper, we propose a methodology that enables “context-driven discovery” for research data thanks to their proven usage across research activities that might differ from the original one, potentially across diverse disciplines. The methodology exploits the collection of publication–dataset and dataset–dataset links provided by OpenAIRE Scholexplorer data citation index so to propagate articles metadata into related research datasets by leveraging semantic relatedness. Such “context propagation” process enables the construction of “context-enriched” metadata of datasets, which enables “context-driven” discoverability of research data. To this end, we provide a real-case evaluation of this technique applied to Scholexplorer. Due to the broad coverage of Scholexplorer, the evaluation documents the effectiveness of this technique at improving data discovery on a variety of research data repositories and databases.

Notes

Please cite this paper as: Baglioni, M., Manghi, P. and Mannocci, A., "Context-Driven Discoverability of Research Data", In proceedings of the 24th International Conference on Theory and Practice of Digital Libraries (TPDL), Lyon, France, 2020. doi: 10.1007/978-3-030-54956-5_15

Files

Context_driven_Discoverability_of_Research_Data___Revised.pdf

Files (358.2 kB)

Additional details

Funding

OpenAIRE-Advance – OpenAIRE Advancing Open Scholarship 777541
European Commission