Published April 22, 2026 | Version v1
Conference paper Open

Narrating Belgian Business History

  • 1. ROR icon Ghent University
  • 2. Ghent Centre for Digital Humanities
  • 3. ROR icon University of Antwerp
  • 4. Royal Library of Belgium

Description

This paper presents BELHISFIRM, a digital humanities project that transforms over two centuries of Belgian corporate records from the Belgian Official Gazette into structured, machine-readable research data. The project builds a six-stage pipeline covering digitization, automatic text recognition, layout classification, named entity recognition, record linkage, and linked data publication to convert millions of archival fragments into coherent firm histories and networks. A key methodological focus is the evaluation of vision-language models versus traditional OCR for handling the complex, irregular layouts of historical company records. The resulting data is published as linked data on an interactive SAMPO-UI platform, enabling researchers to query, browse, and visualize long-term corporate trajectories in Belgian business history.

Files

document.pdf

Files (507.1 kB)

Name Size Download all
md5:4205d257d995d7da2c76fa3e994380cf
507.1 kB Preview Download