Published July 8, 2025 | Version v1
Dataset Open

Finnish migration records 1800–1920 (htr annotation source images)

Description

Source images for the manually annotated HTR dataset of Finnish migration records. The corresponding annotation files can be obtained from https://github.com/TurkuNLP/htr-annotations.

See https://github.com/TurkuNLP/finnish-migration-data for more information about the project.

-------------

LICENSE

These images are obtained from the archives of Finland's Family History Association (Suomen Sukuhistoriallinen Yhdistys ry, https://www.sukuhistoria.fi/sshy/index_eng.htm) with their permission. The images are property of the association, and the association's usage policies apply. You are not allowed to redistribute the images. However, these images can be used to train computer vision models, and trained models can be freely distributed.

For up-to-date usage policy, see FFHA website (https://www.sukuhistoria.fi/sshy/index_eng.htm). The current (8.7.2025) statement says: "Our publicly available material may be used, copied and linked freely. Please note that pictures and other material from or member pages are not to be copied for open use. Also for a publication copying the material, images etc. of member pages is restricted. The source should be mentioned when using our pages (also concerning our publicly available material)."

Files

migration-data-image-release.zip

Files (2.5 GB)

Name Size Download all
md5:1afe12340b4cd22a177f11e51a902851
944 Bytes Download
md5:b06c1cf1080f5d53c2ed8e5706f99ee7
2.5 GB Preview Download