Dataset Open Access

HWRT database of handwritten symbols

Thoma, Martin


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">symbol</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">LaTeX</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">mathematics</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">pattern recognition</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">machine learning</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">on-line recognition</subfield>
  </datafield>
  <controlfield tag="005">20200124192605.0</controlfield>
  <controlfield tag="001">50022</controlfield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">140790596</subfield>
    <subfield code="z">md5:2bf1d089ce65c0a39e57064516f1bd1c</subfield>
    <subfield code="u">https://zenodo.org/record/50022/files/2015-01-28-data.tar</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2015-01-28</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="p">user-computer-vision</subfield>
    <subfield code="o">oai:zenodo.org:50022</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Karlsruhe Institute of Technology</subfield>
    <subfield code="a">Thoma, Martin</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">HWRT database of handwritten symbols</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-computer-vision</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">http://www.opendatacommons.org/licenses/odbl/1.0/</subfield>
    <subfield code="a">ODC Open Database License v1.0</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;The HWRT database of handwritten symbols contains on-line data of handwritten symbols such as all alphanumeric characters, arrows, greek characters and mathematical symbols like the integral symbol.&lt;/p&gt;

&lt;p&gt;The database can be downloaded in form of bzip2-compressed tar files. Each tar file contains:&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;symbols.csv: A CSV file with the rows symbol_id, latex, training_samples, test_samples. The symbol id is an integer, the row latex contains the latex code of the symbol, the rows training_samples and test_samples contain integers with the number of labeled data.&lt;/li&gt;
	&lt;li&gt;train-data.csv: A CSV file with the rows symbol_id, user_id, user_agent and data.&lt;/li&gt;
	&lt;li&gt;test-data.csv: A CSV file with the rows symbol_id, user_id, user_agent and data.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;All CSV files use ";" as delimiter and "'" as quotechar. The data is given in YAML format as a list of lists of dictinaries. Each dictionary has the keys "x", "y" and "time". (x,y) are coordinates and time is the UNIX time.&lt;/p&gt;

&lt;p&gt; &lt;/p&gt;

&lt;p&gt;About 90% of the data was made available by Daniel Kirsch via github.com/kirel/detexify-data. Thank you very much, Daniel!&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">url</subfield>
    <subfield code="i">isSupplementTo</subfield>
    <subfield code="a">http://www.martin-thoma.de/write-math/data/</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">url</subfield>
    <subfield code="i">compiles</subfield>
    <subfield code="a">https://zenodo.org/record/259444</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.50022</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>
1,142
201
views
downloads
All versions This version
Views 1,1421,144
Downloads 201201
Data volume 28.3 GB28.3 GB
Unique views 1,0861,088
Unique downloads 162162

Share

Cite as