Dataset Open Access

Statistics and Evaluation Data for Publication "Using Supervised Learning to Classify Metadata of Research Data by Discipline of Research"

Tobias Weber; Michael Fromm; Nelson Tavares de Sousa

Automated classification of metadata of research data by their discipline(s) of research can be used in scientometric research, by repository service providers, and in the context of research data aggregation services. Openly available metadata of the DataCite index for research data were used to compile a large training and evaluation set comprised of 609,524 records. This publication contains aggregated data for the paper. It also contains the evaluation data of all model/hyper-parameter training and test runs.

Files (91.4 kB)
Name Size
paper_data.tar.gz
md5:e93b229739ae7f646dbeed16233cdc9b
91.4 kB Download
5,250
2,620
views
downloads
All versions This version
Views 5,2505,255
Downloads 2,6202,622
Data volume 239.5 MB239.7 MB
Unique views 3,9753,980
Unique downloads 2,3392,341

Share

Cite as