Published October 15, 2019 | Version v3
Dataset Open

Statistics and Evaluation Data for Publication "Using Supervised Learning to Classify Metadata of Research Data by Field of Study"

  • 1. Leibniz Supercomputing Centre
  • 2. Database Systems Group, Ludwig-Maximilians-Universität München
  • 3. Software Engineering Group, Kiel University

Description

Automated classification of metadata of research data by their discipline(s) of research can be used in scientometric research, by repository service providers, and in the context of research data aggregation services. Openly available metadata of the DataCite index for research data were used to compile a large training and evaluation set comprised of 609,524 records. This publication contains aggregated data for the paper. It also contains the evaluation data of all model/hyper-parameter training and test runs.

Files

Files (164.8 kB)

Name Size Download all
md5:ce24f6ab76eb22c05eb3b1f5b3b4477c
164.8 kB Download

Additional details

Subjects

004 Data processing & computer science
https://dewey.info/
020 Library & information sciences
https://dewey.info/