Dataset Open Access

HWRT database of handwritten symbols

Thoma, Martin


DataCite XML Export

<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.1/metadata.xsd">
  <identifier identifierType="DOI">10.5281/zenodo.50022</identifier>
  <creators>
    <creator>
      <creatorName>Thoma, Martin</creatorName>
      <givenName>Martin</givenName>
      <familyName>Thoma</familyName>
      <affiliation>Karlsruhe Institute of Technology</affiliation>
    </creator>
  </creators>
  <titles>
    <title>HWRT database of handwritten symbols</title>
  </titles>
  <publisher>Zenodo</publisher>
  <publicationYear>2015</publicationYear>
  <subjects>
    <subject>symbol</subject>
    <subject>LaTeX</subject>
    <subject>mathematics</subject>
    <subject>pattern recognition</subject>
    <subject>machine learning</subject>
    <subject>on-line recognition</subject>
  </subjects>
  <dates>
    <date dateType="Issued">2015-01-28</date>
  </dates>
  <resourceType resourceTypeGeneral="Dataset"/>
  <alternateIdentifiers>
    <alternateIdentifier alternateIdentifierType="url">https://zenodo.org/record/50022</alternateIdentifier>
  </alternateIdentifiers>
  <relatedIdentifiers>
    <relatedIdentifier relatedIdentifierType="URL" relationType="IsSupplementTo">http://www.martin-thoma.de/write-math/data/</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="URL" relationType="Compiles">https://zenodo.org/record/259444</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/computer-vision</relatedIdentifier>
  </relatedIdentifiers>
  <rightsList>
    <rights rightsURI="http://www.opendatacommons.org/licenses/odbl/1.0/">ODC Open Database License v1.0</rights>
    <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
  </rightsList>
  <descriptions>
    <description descriptionType="Abstract">&lt;p&gt;The HWRT database of handwritten symbols contains on-line data of handwritten symbols such as all alphanumeric characters, arrows, greek characters and mathematical symbols like the integral symbol.&lt;/p&gt;

&lt;p&gt;The database can be downloaded in form of bzip2-compressed tar files. Each tar file contains:&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;symbols.csv: A CSV file with the rows symbol_id, latex, training_samples, test_samples. The symbol id is an integer, the row latex contains the latex code of the symbol, the rows training_samples and test_samples contain integers with the number of labeled data.&lt;/li&gt;
	&lt;li&gt;train-data.csv: A CSV file with the rows symbol_id, user_id, user_agent and data.&lt;/li&gt;
	&lt;li&gt;test-data.csv: A CSV file with the rows symbol_id, user_id, user_agent and data.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;All CSV files use ";" as delimiter and "'" as quotechar. The data is given in YAML format as a list of lists of dictinaries. Each dictionary has the keys "x", "y" and "time". (x,y) are coordinates and time is the UNIX time.&lt;/p&gt;

&lt;p&gt; &lt;/p&gt;

&lt;p&gt;About 90% of the data was made available by Daniel Kirsch via github.com/kirel/detexify-data. Thank you very much, Daniel!&lt;/p&gt;</description>
  </descriptions>
</resource>
1,139
201
views
downloads
All versions This version
Views 1,1391,141
Downloads 201201
Data volume 28.3 GB28.3 GB
Unique views 1,0831,085
Unique downloads 162162

Share

Cite as