Published January 31, 2019 | Version v1
Project deliverable Open

Methods for Automated Text Digitisation

  • 1. School of Computer Science & Informatics, Cardiff University, Cardiff, United Kingdom
  • 2. Meise Botanic Garden, Meise, Belgium
  • 3. Picturae BV, Heiloo, Netherlands
  • 1. Meise Botanic Garden, Meise, Belgium
  • 2. The Natural History Museum, London, United Kingdom
  • 3. Royal Botanic Gardens, Kew, United Kingdom
  • 4. Finnish Museum of Natural History, LUOMUS, Helsinki, Finland

Description

In this document we describe an effective approach to automated text digitisation with respect to specimen labels. These labels contain much useful data about the specimen including its collector, country of origin and collection date. Our approach to automatically extracting these data takes the form of a pipeline. Recommendations are made for the pipeline’s component parts based on some of the state-of-the-art technologies.

Files

Deliverable D4.1 ICEDIG - Methods for Automated Text Digitisation.pdf

Files (3.5 MB)

Additional details

Funding

ICEDIG – Innovation and consolidation for large scale digitisation of natural heritage 777483
European Commission