Dataset Open Access
<?xml version='1.0' encoding='UTF-8'?> <record xmlns="http://www.loc.gov/MARC21/slim"> <leader>00000nmm##2200000uu#4500</leader> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">research data</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">disciplines of research</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">supervised machine learning</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">multi-label classification</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">text processing</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">data science</subfield> </datafield> <controlfield tag="005">20200420003408.0</controlfield> <controlfield tag="001">3490460</controlfield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">2005185530</subfield> <subfield code="z">md5:2d0dacc2e0902b6ca69e1b997fb6da51</subfield> <subfield code="u">https://zenodo.org/record/3490460/files/l_data_vectorized.tar.gz</subfield> </datafield> <datafield tag="542" ind1=" " ind2=" "> <subfield code="l">open</subfield> </datafield> <datafield tag="260" ind1=" " ind2=" "> <subfield code="c">2019-10-15</subfield> </datafield> <datafield tag="909" ind1="C" ind2="O"> <subfield code="p">openaire_data</subfield> <subfield code="o">oai:zenodo.org:3490460</subfield> </datafield> <datafield tag="100" ind1=" " ind2=" "> <subfield code="u">Leibniz Supercomputing Centre</subfield> <subfield code="0">(orcid)0000-0003-1815-7041</subfield> <subfield code="a">Tobias Weber</subfield> </datafield> <datafield tag="245" ind1=" " ind2=" "> <subfield code="a">l-sized Training and Evaluation Data for Publication "Using Supervised Learning to Classify Metadata of Research Data by Field of Study"</subfield> </datafield> <datafield tag="540" ind1=" " ind2=" "> <subfield code="u">https://creativecommons.org/licenses/by/4.0/legalcode</subfield> <subfield code="a">Creative Commons Attribution 4.0 International</subfield> </datafield> <datafield tag="650" ind1="1" ind2="7"> <subfield code="a">cc-by</subfield> <subfield code="2">opendefinition.org</subfield> </datafield> <datafield tag="650" ind1="1" ind2=" "> <subfield code="a">004 Data processing & computer science</subfield> <subfield code="0">(url)https://dewey.info/</subfield> </datafield> <datafield tag="650" ind1="1" ind2=" "> <subfield code="a">020 Library & information sciences</subfield> <subfield code="0">(url)https://dewey.info/</subfield> </datafield> <datafield tag="520" ind1=" " ind2=" "> <subfield code="a"><p>Automated classification of metadata of research data by their discipline(s) of research can be used in scientometric research, by repository service providers, and in the context of research data aggregation services. Openly available metadata of the DataCite index for research data were used to compile a large training and evaluation set comprised of 609,524 records. This is the cleaned and vectorized version with a feature selection of large size.</p></subfield> </datafield> <datafield tag="773" ind1=" " ind2=" "> <subfield code="n">doi</subfield> <subfield code="i">compiles</subfield> <subfield code="a">10.5281/zenodo.3490329</subfield> </datafield> <datafield tag="773" ind1=" " ind2=" "> <subfield code="n">doi</subfield> <subfield code="i">isVersionOf</subfield> <subfield code="a">10.5281/zenodo.3490459</subfield> </datafield> <datafield tag="024" ind1=" " ind2=" "> <subfield code="a">10.5281/zenodo.3490460</subfield> <subfield code="2">doi</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">dataset</subfield> </datafield> </record>
All versions | This version | |
---|---|---|
Views | 79 | 79 |
Downloads | 25 | 25 |
Data volume | 50.1 GB | 50.1 GB |
Unique views | 73 | 73 |
Unique downloads | 18 | 18 |