Dataset Open Access
Thoma, Martin
<?xml version='1.0' encoding='utf-8'?> <resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.1/metadata.xsd"> <identifier identifierType="DOI">10.5281/zenodo.50022</identifier> <creators> <creator> <creatorName>Thoma, Martin</creatorName> <givenName>Martin</givenName> <familyName>Thoma</familyName> <affiliation>Karlsruhe Institute of Technology</affiliation> </creator> </creators> <titles> <title>HWRT database of handwritten symbols</title> </titles> <publisher>Zenodo</publisher> <publicationYear>2015</publicationYear> <subjects> <subject>symbol</subject> <subject>LaTeX</subject> <subject>mathematics</subject> <subject>pattern recognition</subject> <subject>machine learning</subject> <subject>on-line recognition</subject> </subjects> <dates> <date dateType="Issued">2015-01-28</date> </dates> <resourceType resourceTypeGeneral="Dataset"/> <alternateIdentifiers> <alternateIdentifier alternateIdentifierType="url">https://zenodo.org/record/50022</alternateIdentifier> </alternateIdentifiers> <relatedIdentifiers> <relatedIdentifier relatedIdentifierType="URL" relationType="IsSupplementTo">http://www.martin-thoma.de/write-math/data/</relatedIdentifier> <relatedIdentifier relatedIdentifierType="URL" relationType="Compiles">https://zenodo.org/record/259444</relatedIdentifier> <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/computer-vision</relatedIdentifier> </relatedIdentifiers> <rightsList> <rights rightsURI="http://www.opendatacommons.org/licenses/odbl/1.0/">ODC Open Database License v1.0</rights> <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights> </rightsList> <descriptions> <description descriptionType="Abstract"><p>The HWRT database of handwritten symbols contains on-line data of handwritten symbols such as all alphanumeric characters, arrows, greek characters and mathematical symbols like the integral symbol.</p> <p>The database can be downloaded in form of bzip2-compressed tar files. Each tar file contains:</p> <ul> <li>symbols.csv: A CSV file with the rows symbol_id, latex, training_samples, test_samples. The symbol id is an integer, the row latex contains the latex code of the symbol, the rows training_samples and test_samples contain integers with the number of labeled data.</li> <li>train-data.csv: A CSV file with the rows symbol_id, user_id, user_agent and data.</li> <li>test-data.csv: A CSV file with the rows symbol_id, user_id, user_agent and data.</li> </ul> <p>All CSV files use ";" as delimiter and "'" as quotechar. The data is given in YAML format as a list of lists of dictinaries. Each dictionary has the keys "x", "y" and "time". (x,y) are coordinates and time is the UNIX time.</p> <p> </p> <p>About 90% of the data was made available by Daniel Kirsch via github.com/kirel/detexify-data. Thank you very much, Daniel!</p></description> </descriptions> </resource>
All versions | This version | |
---|---|---|
Views | 1,578 | 1,580 |
Downloads | 287 | 287 |
Data volume | 40.4 GB | 40.4 GB |
Unique views | 1,482 | 1,484 |
Unique downloads | 224 | 224 |