Published May 11, 2021 | Version v1
Presentation Open

Historical Newspaper Content Mining: findings from the impresso project

  • 1. École Polytechnique Fédérale de Lausanne, Digital Humanities Laboratory
  • 2. Leibniz Centre for Contemporary History Potsdam

Description

These are the slides from the 2021 Workshop ‘Historical Newspaper Content Mining: findings from the impresso project’, which was part of the series ‘Applying and deploying Artificial Intelligence (AI) in GLAMs’ organised by AI4LAM (Teaching and Learning Working Group) and co-hosted by LIBER and the BnF.

Extracting content via text mining and making it accessible for scholarly research has been often discussed in the past decade, but the noisy output has stiffened its realisation. ‘Media Monitoring of the Past’ is an interdisciplinary research project in which a team of computational linguists, designers and historians seek to integrate text mining in historical research workflows. The project uses the datafication of a multilingual corpus of digitized historical newspapers from various transnational European collections. The findings of this project have been used for the creation of the impresso app, which allows researchers to explore, use, and share the historical texts from the project’s corpus.

The aim of the workshop series is to provide training opportunities for those interested in applying and deploying Artificial Intelligence (AI) in Libraries, Galleries, Archives, and Museums. The series will bring together a diverse community of experts with subject and domain expertise, as well as technologists across GLAM institutions for a collaborative learning event to share tools and experiences and to reflect on the process of applying AI and its implications for GLAM institutions.

Files

Historical Newspaper Content Mining. Findings from the impresso project.pdf

Files (201.6 MB)