Published October 10, 2023 | Version v1
Dataset Open

ICDAR 2023 CROHME: Competition on Recognition of Handwritten Mathematical Expressions

  • 1. Nantes Université, LS2N
  • 2. Lulea University of Technology
  • 3. Tokyo University of Agriculture and Technology
  • 4. FPT University

Description

Here is the datasets collected for the Competitionon Recognition of Online Handwritten Mathematical Expressions in competition session of ICDAR 2023.  
3 tasks are proposed with different modalities, there are on-line, off-line and bi-modal.  
For on-line task, we provide .inkml file (contain trace information, mathML and LaTeX string), and also symbol level label graph (SymLG) as ground truth. Except the new data and previous CROHME data, we also provide huge amount of artificial on-line data in the train set.   
For off-line task, the .png images (scanned from paper or rendering from inkml) and symbol level label graph (SymLG) are provided. Except the new data and previous CROHME data, we use off-line images from OffHME to increase the size of train set.  
For bi-modal task, both .inkml file and ,png images are provided as 2 channels input, and SymLG as ground truth.  

All the 3 tasks inherited the data collected from the previous 6 CROHME, and also the new collection 2023 in 3 sites, Nantes (France), Luleå (Sweden) and Tokyo (Japan).

Notes

## Tools CROHMElib (CROHME data converting tools and viewer): https://gitlab.univ-nantes.fr/crohme/crohmelib Lgeval (Label Graph Evaluation tools): https://gitlab.com/dprl/lgeval

Files

CROHME23.zip

Files (1.8 GB)

Name Size Download all
md5:9eb899ddb0e87c80a17f489bf6162c33
1.8 GB Preview Download

Additional details

Related works

Is published in
Conference paper: 10.1007/978-3-031-41679-8_33 (DOI)

References

  • Xie, Yejing, et al. "ICDAR 2023 CROHME: Competition on Recognition of Handwritten Mathematical Expressions." International Conference on Document Analysis and Recognition. Cham: Springer Nature Switzerland, 2023.
  • Truong, Thanh-Nghia, Cuong Tuan Nguyen, and Masaki Nakagawa. "Syntactic data generation for handwritten mathematical expression recognition." Pattern Recognition Letters 153 (2022): 83-91.
  • Wang, Da-Han, et al. "ICFHR 2020 competition on offline recognition and spotting of handwritten mathematical expressions-OffRaSHME." 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR). IEEE, 2020.