Fraktur model trained from enhanced Austrian Newspapers dataset

Weil, Stefan; Kamlah, Jan; Schmidt, Thomas

doi:10.5281/zenodo.7933402

Published April 26, 2023 | Version 2

Other Open

Fraktur model trained from enhanced Austrian Newspapers dataset

1. University of Mannheim

Recognition model for 19th century German Fraktur texts, trained with ground truth from Austrian Newspapers.

The ground truth which was used for the training is available from Mannheim University Library (see also details on the training process): https://github.com/UB-Mannheim/AustrianNewspapers/

It is based on this dataset:

Günter Mühlberger, & Günter Hackl. (2019). NewsEye / READ OCR training dataset from Austrian Newspapers (19th C.) [Data set]. Zenodo. http://doi.org/10.5281/zenodo.3387369

Notes

improved model based on fixed ground truth

Files

metadata.json

Files (16.3 MB)

Name	Size	Download all
austriannewspapers.mlmodel md5:acb1a9c733248f5a77faf83d12571940	16.3 MB	Download
metadata.json md5:d11eb2ad95d433f01c2003672b7ea29e	1.8 kB	Preview Download

Additional details

Deutsche Forschungsgemeinschaft
Workflow für werkspezifisches Training auf Basis generischer Modelle mit OCR-D sowie Ground-Truth-Aufwertung 460547474

Views

Downloads

Show more details

	All versions	This version
Views	1,472	976
Downloads	5,776	4,265
Data volume	8.3 GB	5.3 GB

More info on how stats are collected....

DOI

Resource type

Other

Publisher

Zenodo

License: Creative Commons Attribution Share Alike 4.0 International

Permits almost any use subject to providing credit and license notice. Frequently used for media assets and educational materials. The most common license for Open Access scientific publications. Not recommended for software. Read more

Technical metadata

Created: May 13, 2023
Modified: March 23, 2026

Fraktur model trained from enhanced Austrian Newspapers dataset

Authors/Creators

Description

Notes

Files

metadata.json

Files (16.3 MB)

Additional details

Funding