Published April 20, 2026 | Version v1
Conference paper Open

Sexual Abuse in the Catholic Church: Annotation and Machine Learning of Hidden Patterns in Abuse Reports

  • 1. Chair of Pastoral Theology and Homiletics, University of Regensburg, Germany
  • 2. Media Informatics Group, University of Regensburg, Germany

Description

This paper presents a computational-theological analysis of sexual abuse reports published by Catholic dioceses in German-speaking regions, with a focus on the largely overlooked experiences of women survivors. We compiled a corpus of five representative reports and developed a hierarchical annotation scheme to capture discursive patterns related to victims, perpetrators, institutional responses, and descriptions of abuse. Quantitative analyses reveal a notable number of female victims and a consistent dominance of institutional voices, while survivor perspectives appear far less frequently. Descriptions of abuse often rely on unspecific terms, reflecting discursive tendencies that obscure the nature of violence and reinforce epistemic injustice. Using sentence-level binary classification with transformer-based models, we achieved promising results (balanced accuracy ~0.8) despite class imbalance. These models enable scalable detection of text segments concerning women. The study demonstrates how DH methods can uncover hidden narrative structures in church documents and support critical theological inquiry.

Files

Hürten_DHBenelux_2026 (1).pdf

Files (516.0 kB)

Name Size Download all
md5:54b81c2ca95c037bb094ae70b0c7b60a
516.0 kB Preview Download

Additional details