Dataset Open Access
Thoma, Martin
<?xml version='1.0' encoding='UTF-8'?> <record xmlns="http://www.loc.gov/MARC21/slim"> <leader>00000nmm##2200000uu#4500</leader> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">symbol</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">LaTeX</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">mathematics</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">pattern recognition</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">machine learning</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">on-line recognition</subfield> </datafield> <controlfield tag="005">20200124192605.0</controlfield> <controlfield tag="001">50022</controlfield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">140790596</subfield> <subfield code="z">md5:2bf1d089ce65c0a39e57064516f1bd1c</subfield> <subfield code="u">https://zenodo.org/record/50022/files/2015-01-28-data.tar</subfield> </datafield> <datafield tag="542" ind1=" " ind2=" "> <subfield code="l">open</subfield> </datafield> <datafield tag="260" ind1=" " ind2=" "> <subfield code="c">2015-01-28</subfield> </datafield> <datafield tag="909" ind1="C" ind2="O"> <subfield code="p">openaire_data</subfield> <subfield code="p">user-computer-vision</subfield> <subfield code="o">oai:zenodo.org:50022</subfield> </datafield> <datafield tag="100" ind1=" " ind2=" "> <subfield code="u">Karlsruhe Institute of Technology</subfield> <subfield code="a">Thoma, Martin</subfield> </datafield> <datafield tag="245" ind1=" " ind2=" "> <subfield code="a">HWRT database of handwritten symbols</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">user-computer-vision</subfield> </datafield> <datafield tag="540" ind1=" " ind2=" "> <subfield code="u">http://www.opendatacommons.org/licenses/odbl/1.0/</subfield> <subfield code="a">ODC Open Database License v1.0</subfield> </datafield> <datafield tag="650" ind1="1" ind2="7"> <subfield code="a">cc-by</subfield> <subfield code="2">opendefinition.org</subfield> </datafield> <datafield tag="520" ind1=" " ind2=" "> <subfield code="a"><p>The HWRT database of handwritten symbols contains on-line data of handwritten symbols such as all alphanumeric characters, arrows, greek characters and mathematical symbols like the integral symbol.</p> <p>The database can be downloaded in form of bzip2-compressed tar files. Each tar file contains:</p> <ul> <li>symbols.csv: A CSV file with the rows symbol_id, latex, training_samples, test_samples. The symbol id is an integer, the row latex contains the latex code of the symbol, the rows training_samples and test_samples contain integers with the number of labeled data.</li> <li>train-data.csv: A CSV file with the rows symbol_id, user_id, user_agent and data.</li> <li>test-data.csv: A CSV file with the rows symbol_id, user_id, user_agent and data.</li> </ul> <p>All CSV files use ";" as delimiter and "'" as quotechar. The data is given in YAML format as a list of lists of dictinaries. Each dictionary has the keys "x", "y" and "time". (x,y) are coordinates and time is the UNIX time.</p> <p> </p> <p>About 90% of the data was made available by Daniel Kirsch via github.com/kirel/detexify-data. Thank you very much, Daniel!</p></subfield> </datafield> <datafield tag="773" ind1=" " ind2=" "> <subfield code="n">url</subfield> <subfield code="i">isSupplementTo</subfield> <subfield code="a">http://www.martin-thoma.de/write-math/data/</subfield> </datafield> <datafield tag="773" ind1=" " ind2=" "> <subfield code="n">url</subfield> <subfield code="i">compiles</subfield> <subfield code="a">https://zenodo.org/record/259444</subfield> </datafield> <datafield tag="024" ind1=" " ind2=" "> <subfield code="a">10.5281/zenodo.50022</subfield> <subfield code="2">doi</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">dataset</subfield> </datafield> </record>
All versions | This version | |
---|---|---|
Views | 1,578 | 1,580 |
Downloads | 287 | 287 |
Data volume | 40.4 GB | 40.4 GB |
Unique views | 1,482 | 1,484 |
Unique downloads | 224 | 224 |