There is a newer version of this record available.

Dataset Open Access

ICDAR 2019 Competition on Baseline Detection (cBAD)

Diem Markus; Kleber Florian; Gatos Basilis

This dataset contains the training, evaluation, and test set for the ICDAR 2019 Competition on Baseline Detection (cBAD).

A newly created freely available real world dataset consisting of 3021 annotated document page images that are collected from seven European archives and form the basis of cBAD. The baselines in all images were manually annotated. The training and the evaluation sets contain PAGE XMLs with annotated text regions and baselines. The groundtruth for the test set will be published after the competition deadline (May 2019).

Competition Website:

Files (4.7 GB)
Name Size
4.7 GB Download
All versions This version
Views 1,4141,158
Downloads 1,7991,467
Data volume 8.4 TB6.8 TB
Unique views 1,2141,046
Unique downloads 632554


Cite as