Nuremberg Letterbooks: A Multi-Transcriptional Dataset of Early 15th Century Manuscripts for Document Analysis
Creators
- 1. Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg
- 2. Senior Fellow of Medieval History, Friedrich-Alexander-Universität Erlangen-Nürnberg
- 3. Department of German Linguistics, Friedrich-Alexander-Universität Erlangen-Nürnberg
- 4. Chair of Regional History of Bavaria and Franconia, Friedrich-Alexander-Universität Erlangen-Nürnberg
Description
This dataset contains the images and labels of the Nuremberg Letterbooks dataset.
It consists of four books (books 2 - 5) with line-wise transcriptions. Three kinds of transcriptions are reported: basic, regularized, and diplomatic, with additional expanded abbreviations.
Code templates for text verification and writer verification are available at:
- https://github.com/M4rt1nM4yr/letterbooks_text_verification
- https://github.com/M4rt1nM4yr/letterbooks_writer_verification
When using this dataset, please cite:
M. Mayr, J. Krenz, K. Neumeier, A. Bub, S. Bürcky, N. Brolich, K. Herbers, M. Habermann, P. Fleischmann, A. Maier, and V. Christlein.
Nuremberg Letterbooks: A Multi-Transcriptional Dataset of Early 15th Century Manuscripts for Document Analysis. Sci Data 12, 811 (2025).
https://doi.org/10.1038/s41597-025-05144-z
Files
images.zip
Files
(2.6 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:b3698ae828da4632121165d331628394
|
2.3 GB | Preview Download |
|
md5:ce2c6150d9fc45ac4b4ea2a439b7aa8e
|
262.2 MB | Preview Download |
Additional details
Funding
- Deutsche Forschungsgemeinschaft
- Kommunikation und Sprache im Reich. Die Nürnberger Briefbücher im 15. Jahrhundert: Automatische Handschriftenerkennung - historische und sprachwissenschaftliche Analyse 416910787