Sexual Abuse in the Catholic Church: Annotation and Machine Learning of Hidden Patterns in Abuse Reports
Authors/Creators
- 1. Chair of Pastoral Theology and Homiletics, University of Regensburg, Germany
- 2. Media Informatics Group, University of Regensburg, Germany
Description
This paper presents a computational-theological analysis of sexual abuse reports published by Catholic dioceses in German-speaking regions, with a focus on the largely overlooked experiences of women survivors. We compiled a corpus of five representative reports and developed a hierarchical annotation scheme to capture discursive patterns related to victims, perpetrators, institutional responses, and descriptions of abuse. Quantitative analyses reveal a notable number of female victims and a consistent dominance of institutional voices, while survivor perspectives appear far less frequently. Descriptions of abuse often rely on unspecific terms, reflecting discursive tendencies that obscure the nature of violence and reinforce epistemic injustice. Using sentence-level binary classification with transformer-based models, we achieved promising results (balanced accuracy ~0.8) despite class imbalance. These models enable scalable detection of text segments concerning women. The study demonstrates how DH methods can uncover hidden narrative structures in church documents and support critical theological inquiry.
Files
Hürten_DHBenelux_2026 (1).pdf
Files
(516.0 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:54b81c2ca95c037bb094ae70b0c7b60a
|
516.0 kB | Preview Download |
Additional details
Software
- Repository URL
- https://github.com/lauchblatt/Epi_Epa