Crowdsourcing Online Handwriting Acquisition to Develop and Deploy a Unicode Character Classifier

Daniel Martín-Albo; Francisco Álvaro

doi:10.1109/ICFHR-2018.2018.00051

Published August 8, 2018 | Version v1

Conference paper Open

Crowdsourcing Online Handwriting Acquisition to Develop and Deploy a Unicode Character Classifier

1. Wiris

There are thousands of Unicode characters and hence it can be hard to visually find a particular one. For this reason, we aimed at developing a tool that allows to handwrite a character and receive a list of the most similar candidates to that input. This tool will be integrated in a math editor which handles more than 5,000 different Unicode characters. Since no public datasets were found to fit ur needs, we crowdsourced the acquisition of online handwritten data for training purposes. We developed a neural network combining convolutional layers with shape-based features to classify online handwritten Unicode characters. To make the model more robust to input variability, we used data augmentation in the form of affine transformations. We achieved a top-20 error rate of 12.64% on validation data and received positive feedback from users, thus validating that crowdsourcing is a proper method for online handwriting acquisition. Finally, we deployed the model wrapped in a JSON-based REST API and released a public demo using it. his way, we present the full development cycle of a Unicode character classifier.

Files

icfhr2018_final.pdf

Files (909.2 kB)

Name	Size	Download all
icfhr2018_final.pdf md5:d7239396b328b0ac37c52a40afd14e03	909.2 kB	Preview Download

Additional details

iMuSciCA – Interactive Music Science Collaborative Activities 731861: European Commission

	All versions	This version
Views	46	45
Downloads	200	199
Data volume	186.4 MB	185.5 MB

Crowdsourcing Online Handwriting Acquisition to Develop and Deploy a Unicode Character Classifier

Creators

Description

Files

icfhr2018_final.pdf

Files (909.2 kB)

Additional details

Funding