10.5281/zenodo.3841797
https://zenodo.org/records/3841797
oai:zenodo.org:3841797
Tobias Weber
Tobias Weber
0000-0003-1815-7041
Leibniz Supercomputing Centre
Michael Fromm
Michael Fromm
0000-0002-7244-4191
Database Systems Group, Ludwig-Maximilians-Universität München
Nelson Tavares de Sousa
Nelson Tavares de Sousa
0000-0003-1866-7156
Software Engineering Group, Kiel University
Statistics and Evaluation Data for Publication "Using Supervised Learning to Classify Metadata of Research Data by Field of Study"
Zenodo
2019
supervised machine learning
multi-label classification
research data
text processing
data science
disciplines of research
2019-10-15
10.5281/zenodo.3490467
Creative Commons Attribution 4.0 International
Automated classification of metadata of research data by their discipline(s) of research can be used in scientometric research, by repository service providers, and in the context of research data aggregation services. Openly available metadata of the DataCite index for research data were used to compile a large training and evaluation set comprised of 609,524 records. This publication contains aggregated data for the paper. It also contains the evaluation data of all model/hyper-parameter training and test runs.