Dataset Open Access
Thoma, Martin
{ "files": [ { "links": { "self": "https://zenodo.org/api/files/4acf2c83-52f2-4a45-9b6c-75cb3a34b9da/2015-01-28-data.tar" }, "checksum": "md5:2bf1d089ce65c0a39e57064516f1bd1c", "bucket": "4acf2c83-52f2-4a45-9b6c-75cb3a34b9da", "key": "2015-01-28-data.tar", "type": "tar", "size": 140790596 } ], "owners": [ 5396 ], "doi": "10.5281/zenodo.50022", "stats": { "version_unique_downloads": 224.0, "unique_views": 1476.0, "views": 1571.0, "version_views": 1569.0, "unique_downloads": 224.0, "version_unique_views": 1474.0, "volume": 40406901052.0, "version_downloads": 287.0, "downloads": 287.0, "version_volume": 40406901052.0 }, "links": { "doi": "https://doi.org/10.5281/zenodo.50022", "latest_html": "https://zenodo.org/record/50022", "bucket": "https://zenodo.org/api/files/4acf2c83-52f2-4a45-9b6c-75cb3a34b9da", "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.50022.svg", "html": "https://zenodo.org/record/50022", "latest": "https://zenodo.org/api/records/50022" }, "created": "2016-04-27T06:43:07+00:00", "updated": "2020-01-24T19:26:05.595960+00:00", "conceptrecid": "632739", "revision": 14, "id": 50022, "metadata": { "access_right_category": "success", "doi": "10.5281/zenodo.50022", "description": "<p>The HWRT database of handwritten symbols contains on-line data of handwritten symbols such as all alphanumeric characters, arrows, greek characters and mathematical symbols like the integral symbol.</p>\n\n<p>The database can be downloaded in form of bzip2-compressed tar files. Each tar file contains:</p>\n\n<ul>\n\t<li>symbols.csv: A CSV file with the rows symbol_id, latex, training_samples, test_samples. The symbol id is an integer, the row latex contains the latex code of the symbol, the rows training_samples and test_samples contain integers with the number of labeled data.</li>\n\t<li>train-data.csv: A CSV file with the rows symbol_id, user_id, user_agent and data.</li>\n\t<li>test-data.csv: A CSV file with the rows symbol_id, user_id, user_agent and data.</li>\n</ul>\n\n<p>All CSV files use \";\" as delimiter and \"'\" as quotechar. The data is given in YAML format as a list of lists of dictinaries. Each dictionary has the keys \"x\", \"y\" and \"time\". (x,y) are coordinates and time is the UNIX time.</p>\n\n<p>\u00a0</p>\n\n<p>About 90% of the data was made available by Daniel Kirsch via github.com/kirel/detexify-data. Thank you very much, Daniel!</p>", "license": { "id": "ODbL-1.0" }, "title": "HWRT database of handwritten symbols", "relations": { "version": [ { "count": 1, "index": 0, "parent": { "pid_type": "recid", "pid_value": "632739" }, "is_last": true, "last_child": { "pid_type": "recid", "pid_value": "50022" } } ] }, "communities": [ { "id": "computer-vision" } ], "keywords": [ "symbol", "LaTeX", "mathematics", "pattern recognition", "machine learning", "on-line recognition" ], "publication_date": "2015-01-28", "creators": [ { "affiliation": "Karlsruhe Institute of Technology", "name": "Thoma, Martin" } ], "access_right": "open", "resource_type": { "type": "dataset", "title": "Dataset" }, "related_identifiers": [ { "scheme": "url", "identifier": "http://www.martin-thoma.de/write-math/data/", "relation": "isSupplementTo" }, { "scheme": "url", "identifier": "https://zenodo.org/record/259444", "relation": "compiles" } ] } }
All versions | This version | |
---|---|---|
Views | 1,569 | 1,571 |
Downloads | 287 | 287 |
Data volume | 40.4 GB | 40.4 GB |
Unique views | 1,474 | 1,476 |
Unique downloads | 224 | 224 |