Published June 9, 2023 | Version 1
Dataset Open

Dataset for Paper: Text Line Detection and Recognition of Greek Polytonic Documents

  • 1. Institute of Informatics and Telecommunications, National Centre for Scientific Research "Demokritos"
  • 2. Department of English Studies, University of Cyprus,

Description

Dataset for Paper: Text Line Detection and Recognition of Greek Polytonic Documents, P. Kaddas, B. Gatos, K. Palaiologos, K. Christopoulou and K. Kritsis, 4th Workshop on Machine Learning (WML), San Jose, California, USA

We introduce a new dataset, named GTLD-small dataset, with annotated text line quadrilateral polygons of 1.642 documents, including annotations on 3 datasets (Tobacco-3482, PIOP and ShakeIT dataset)

Overview of the datasets included in this work and the number of images used for training, validation and testing.
Collection #Total #train #val #test
PIOP-small 950 672 90 188
ShakeIT-small 357 264 27 66
Tobacco-3482-small 335 240 30 65

 

Files

GTLD-small.zip

Files (3.3 GB)

Name Size Download all
md5:6dd8b2d494bfbc211552460f5985218a
3.3 GB Preview Download