Published February 17, 2019 | Version v4
Dataset Open

ICDAR 2019 Competition on Baseline Detection (cBAD)

  • 1. Computer Vision Lab, TU Wien
  • 2. National Center for Scientific Research "Demokritos"

Description

This dataset contains the training, evaluation, and test set for the ICDAR 2019 Competition on Baseline Detection (cBAD).

A newly created freely available real world dataset consisting of 3021 annotated document page images that are collected from seven European archives and form the basis of cBAD. The baselines in all images were manually annotated. The training and the evaluation sets contain PAGE XMLs with annotated text regions and baselines.

Competition Website: https://scriptnet.iit.demokritos.gr/competitions/11/

Files

READ-ICDAR2019-cBAD-dataset.zip

Files (4.7 GB)

Name Size Download all
md5:17f28f57b1239af40a8ef7cd80101059
4.7 GB Preview Download

Additional details

Funding

READ – Recognition and Enrichment of Archival Documents 674943
European Commission