Published May 7, 2026 | Version v1
Poster Open

Beyond Archives: Designing a Segmentation and ATR Workflow for Mid-Twentieth-Century Typescripts in a Contested-Memory Case Study

Authors/Creators

  • 1. ROR icon University of Naples - L'Orientale

Description

Abstract of the poster “Beyond Archives: Designing a Segmentation and ATR Workflow for Mid-Twentieth-Century Typescripts in a Contested-Memory Case Study”, presented at the “DARIAH Annual Event 2026: Digital Arts and Humanities With and For Society: Building Infrastructures of Engagement” in Rome, Italy, 26–29 May 2026.
Poster ID: 125.

The Beyond Archives project aims to develop a relational digital library for memory sources in contested historical settings, connecting institutional records and oral testimonies while keeping provenance, access conditions, and narrative contexts readable. The poster presents the written-stream workflow of the project, focusing on layout segmentation and Automatic Text Recognition (ATR) for mid-twentieth-century Italian typescripts from the Prefecture of Naples records. It shows how PAGE/ALTO outputs, TEI-oriented structuring, controlled correction, and paradata documentation can support reusable access while keeping model performance, residual errors, and remaining limitations inspectable.

Files

DARIAH_Lembo_poster.pdf

Files (22.2 MB)

Name Size Download all
md5:d844eb56dfe1c0843b694708453f57f7
1.1 MB Preview Download
md5:044a7d3d22bd287e9a05202a466463b7
21.1 MB Preview Download