Project deliverable Open Access

Methods for Automated Text Digitisation

Owen David; Groom Quentin; Hardisty Alex; Leegwater Thijs; van Walsum Myriam; Wijkamp Noortje; Spasić Irena

Project member(s)
Dillen Mathias; Livermore Laurence; Phillips Sarah; Wu Zhengzhe

In this document we describe an effective approach to automated text digitisation with respect to specimen labels. These labels contain much useful data about the specimen including its collector, country of origin and collection date. Our approach to automatically extracting these data takes the form of a pipeline. Recommendations are made for the pipeline’s component parts based on some of the state-of-the-art technologies.

Files (3.5 MB)
Name Size
Deliverable D4.1 ICEDIG - Methods for Automated Text Digitisation.pdf
3.5 MB Download
All versions This version
Views 1816
Downloads 1818
Data volume 63.0 MB63.0 MB
Unique views 1514
Unique downloads 1515


Cite as