3490460
doi
10.5281/zenodo.3490460
oai:zenodo.org:3490460
l-sized Training and Evaluation Data for Publication "Using Supervised Learning to Classify Metadata of Research Data by Field of Study"
Tobias Weber
Leibniz Supercomputing Centre
doi:10.5281/zenodo.3490329
info:eu-repo/semantics/openAccess
Creative Commons Attribution 4.0 International
https://creativecommons.org/licenses/by/4.0/legalcode
research data
disciplines of research
supervised machine learning
multi-label classification
text processing
data science
<p>Automated classification of metadata of research data by their discipline(s) of research can be used in scientometric research, by repository service providers, and in the context of research data aggregation services. Openly available metadata of the DataCite index for research data were used to compile a large training and evaluation set comprised of 609,524 records. This is the cleaned and vectorized version with a feature selection of large size.</p>
Zenodo
2019-10-15
info:eu-repo/semantics/other
3490459
1587342848.351904
2005185530
md5:2d0dacc2e0902b6ca69e1b997fb6da51
https://zenodo.org/records/3490460/files/l_data_vectorized.tar.gz
public
10.5281/zenodo.3490329
Compiles
doi
10.5281/zenodo.3490459
isVersionOf
doi