There is a newer version of the record available.

Published June 22, 2020 | Version 1.00
Dataset Open

Two Datasets for the Computational Authorship Analysis of Medieval Latin Texts

  • 1. Scuola Normale Superiore
  • 2. Consiglio Nazionale delle Ricerche
  • 3. Università di Pisa

Description

We make available MedLatin1 and MedLatin2, two datasets of medieval Latin texts to be used in research on computational authorship analysis. MedLatin1 and MedLatin2 consist of 294 and 30 curated texts, respectively, labelled by author, with MedLatin1 texts being of an epistolary nature and MedLatin2 texts consisting of literary comments and treatises about various subjects. As such, these two datasets lend themselves to supporting research in authorship analysis tasks, such as authorship attribution, authorship verification, or same-author verification.

Notes

See also "MedieValla: An authorship verification tool written in Python for medieval Latin", https://zenodo.org/record/3903236

Files

MedLatin.zip

Files (3.5 MB)

Name Size Download all
md5:8f1ede8cbe2b20fb84ca8b31c2e536fe
3.5 MB Preview Download