Published August 6, 2022 | Version v2
Preprint Open

Model the source first! Towards Computer-Assisted Semantic Text Modelling and source criticism 2.0

  • 1. Masaryk University, Centre for the Digital Research of Religion


This article presents a proposal for data collection from textual resources in history and the social sciences that we call Computer-Assisted Semantic Text Modelling (CASTEMO). The CASTEMO data model and data collection workflow is based on detailed, yet flexible semantic encoding of the original natural-language syntactic structure and wording: translating texts line by line into structured data while preserving all of their vagaries, complexities, conflicting testimonies and the like. We outline a thorough way of modelling the sources in order to make them accessible to all manner of quantitative and computational analyses.


Model the source first - v. 2.pdf

Files (1.1 MB)

Name Size Download all
1.1 MB Preview Download

Additional details


DISSINET – Networks of Dissent: Computational Modelling of Dissident and Inquisitorial Cultures in Medieval Europe 101000442
European Commission