Published September 9, 2024 | Version v1
Other Open

Model trained on 11th century manuscripts to produce expanded transcription (Latin).

  • 1. Universität Kassel

Description

This model has been trained as part of the ongoing edition project Burchards Dekret Digital (www.burchards-dekret-digital.de), funded by the Academy of Sciences and Literature Mainz. It is the project's first high-quality model specifically designed to produce a normalized transcription. The model was trained on three 11th-century manuscripts that can be traced to the episcopal scriptorium in Worms: Bamberg, SB, Msc.Can.6 (https://mdz-nbn-resolving.de/urn:nbn:de:bvb:12-bsb00140701-0), Frankfurt, UB, Ms. Barth. 50 (https://sammlungen.ub.uni-frankfurt.de/msma/urn/urn:nbn:de:hebis:30:2-12488) and Vatican, BAV, Pal.lat.585 (https://digi.vatlib.it/mss/detail/Pal.lat.585). However, it also works well as a base model for later medieval scripts. The model was trained by Dr. Michael Schonhardt (Universität Kassel, https://orcid.org/0000-0002-2750-1900). Transcriptions were provided and proofread by Helena Geitz, Daniel Gneckow, Dr. Andreas Grote, Prof. Dr. Lotte Kéry, Dr. Birgit Kynast, Dr. Hans-Christian Lehner, Dr. Melanie Panse-Buchwalter, Michaela Parma, Dr. Cornelia Scherer, Dr. Michael Schonhardt and Dr. des. Elena Vanelli. The project is led by Prof. Dr. Ingrid Baumgärtner, Prof. Dr. Klaus Herbers and Prof. Dr. Ludger Körntgen. The model was trained in 47 epochs using a learning rate of 0.0008 and a batch size of 64.

Files

metadata.json

Files (16.1 MB)

Name Size Download all
md5:e3e3ba1dca7f65c6685cdd8f47d73be9
16.1 MB Download
md5:e45721ed3ac23efe725caf5eeda5ba85
2.2 kB Preview Download