Published December 21, 2016 | Version v1
Dataset Open

read_dataset_german_konzilsprotokolle

Description

This dataset arises from the READ project (Horizon 2020).

Images were provided and enriched under the lead of Dr. Dirk Alvermann (Universitätsarchiv Greifswald - Germany).
All in all this dataset contains 8770 trainscribed textlines of handwritten historical documents from the late 18th century.

Besides the images and page-files (containing geometric textline information and transcripts), lists dividing the dataset in train and test data are provided (each list element contains the corresponding image, textregion and textline identifiers and therefore an explicit mapping of a list element to a textline is possible). Furthermore sublists of the train list are given.
 

Files

Files (5.9 GB)

Name Size Download all
md5:ab7c6dc3e4966405eddb6ea5a356e364
5.9 GB Download

Additional details

Funding

READ – Recognition and Enrichment of Archival Documents 674943
European Commission