Dataset Open Access
A hand is usually considered as a unique characteristic of a person. However, it may slightly change over their whole lifespan. This change might be due to some physical or mental issues. To the best of our knowledge, there is no dataset available, which covers this aspect of evolvement of handwriting of a single person.
When dealing with archival documents, it is important to show that methods are invariant against these changes or investigate how much of these changes are covered. Thus, a new dataset was created with data of the Passau Diocesan Archives (ABP, https://www.bistum-passau.de/bistum/archiv ).
The documents originate from death records of different villages or towns in the Diocese of Passau. Usually the writer of these records (mostly the priest) remains the same over several years. In total, the dataset consists of 1766 pages, which originate from 28 different writers. The number of pages per writer varies from 7 up to 311. For some writers, we only have data from 3 different years, whereas the largest time span between two documents of the same writer is 31 years.
The dataset is organized as follows:
The corresponding PAGE XML file is provided along with the dataset and contains the regions of the image where text is included. This file can be used to calculate features of the writer solely on the handwriting and not on the table lines.
Currently no research tasks are defined on the dataset; we leave this up to the community. Drop us a note how you are using this dataset.