Published September 21, 2025 | Version v1
Dataset Open

CROHME Contextual Dataset

  • 1. ROR icon Laboratoire des Sciences du Numérique de Nantes
  • 2. ROR icon Nantes Université

Description

This dataset supports CROHME2019 [1] and CROHME2023 [2] Testset with contextual information. It provides aligned LaTeX expressions extracted from source documents, along with surrounding textual context, to facilitate formula recognition experiments. The paper is publised in ICDAR 2025 M3RD Workshop [3].

The dataset contains two main components: the original source files and processed JSON data aligned with CROHME test sets.

Files

README.md

Files (323.0 MB)

Name Size Download all
md5:9edb7849da74a3ab7e720600be7e96e5
2.1 MB Preview Download
md5:c8a8d727f8c6b4c9cf41f72341d4265c
4.4 MB Preview Download
md5:c0739071a79bc77190fc9cb1d0ba6ee3
3.1 kB Preview Download
md5:21eb6a9fee39be05b128be980e5c7fc1
316.5 MB Preview Download

Additional details

Related works

Is described by
Conference paper: https://hal.science/hal-05252349 (URL)