{"authors": [{"name": "Michael Schonhardt", "affiliation": "Universit\u00e4t Kassel"}], "summary": "Model trained on 11th century manuscripts to produce graphematic transcription (Latin).", "description": "This model has been trained as part of the ongoing edition project Burchards Dekret Digital (www.burchards-dekret-digital.de), funded by the Academy of Sciences and Literature Mainz. It is the project's first high-quality model specifically designed to produce a graphematic transcription based on a predefined set of special characters (https://github.com/michaelscho/transpy?tab=readme-ov-file#special-characters) in accordance with the MUFI standard. The model was trained on five 11th-century manuscripts that can be traced to the episcopal scriptorium in Worms: Bamberg, SB, Msc.Can.6 (https://mdz-nbn-resolving.de/urn:nbn:de:bvb:12-bsb00140701-0), Frankfurt, UB, Ms. Barth. 50 (https://sammlungen.ub.uni-frankfurt.de/msma/urn/urn:nbn:de:hebis:30:2-12488), K\u00f6ln, EDD, Cod. 119 (https://digital.dombibliothek-koeln.de/urn/urn:nbn:de:hbz:kn28-3-3241), Vatican, BAV, Pal.lat.585 (https://digi.vatlib.it/mss/detail/Pal.lat.585), and Vatican, BAV, Pal.lat.586 (https://digi.vatlib.it/mss/detail/Pal.lat.586). However, it also works well as a base model for later medieval scripts.\nThe model was trained by Dr. Michael Schonhardt (Universit\u00e4t Kassel, https://orcid.org/0000-0002-2750-1900). Transcriptions were provided and proofread by Helena Geitz, Daniel Gneckow, Dr. Andreas Grote, Prof. Dr. Lotte K\u00e9ry, Dr. Birgit Kynast, Dr. Hans-Christian Lehner, Dr. Melanie Panse-Buchwalter, Michaela Parma, Dr. Cornelia Scherer, Dr. Michael Schonhardt and Dr. des. Elena Vanelli. The project is led by Prof. Dr. Ingrid Baumg\u00e4rtner, Prof. Dr. Klaus Herbers and Prof. Dr. Ludger K\u00f6rntgen.\nThe model was trained in 54 epochs using a learning rate of 0.0008 and a batch size of 64.", "accuracy": 96.4540958404541, "license": "CC-BY-4.0", "script": ["Latn"], "name": "bdd-wormser-scriptorium-abbreviated-0.2.mlmodel", "graphemes": [" ", "#", "*", "+", ".", "/", "0", "1", "2", "3", "4", "5", "6", "7", "8", "9", ":", ";", "<", ">", "A", "B", "C", "D", "E", "F", "G", "H", "I", "J", "K", "L", "M", "N", "O", "P", "Q", "R", "S", "T", "U", "V", "X", "Y", "Z", "[", "]", "a", "b", "c", "d", "e", "f", "g", "h", "i", "j", "k", "l", "m", "n", "o", "p", "q", "r", "s", "t", "u", "v", "x", "y", "z", "\u00ac", "\u00b4", "\u0111", "\u0119", "\u0127", "\u014d", "\u0180", "\u019a", "\u0304", "\u0305", "\u0365", "\u0366", "\u0367", "\u1dd2", "\u1dd3", "\u211e", "\u2234", "\ua741", "\ua748", "\ua749", "\ua750", "\ua751", "\ua752", "\ua753", "\ua756", "\ua757", "\ua759", "\ua75d", "\ue665", "\ue8b3", "\uf160", "\uf1ac", "\uf1c2", "\uf1e1", "\uf1e4", "\uf1ea", "\uf1f0", "\uf1f8"]}