There is a newer version of the record available.

Published October 21, 2025 | Version v1
Dataset Open

Dataset of Manuscripts Signed and Attributed to Raoulet d'Orléans and Henri de Trévou

  • 1. ROR icon Institut de Recherche et d'Histoire des Textes
  • 2. ROR icon Laboratoire d'Informatique Gaspard-Monge

Description

This repository hosts the dataset used for the paper “Reconciling traditional and computational methods for the analysis of scribal hands; the case of Raoulet d’Orléans and Henri de Trévou (XIVe c.)”, presented at the 23rd Colloquium of Latin Palaeography (CIPL), 17-19 September 2025, Vienna (expected to be published in Brepols as part of the Colloque's proceedings, in 2026). 

Contents

The archive RaouletOrleans_HenriTrevou_dataset.zip contains the following structure:

├── images/
└── annotation.json

└── RaouletOrleans_HenriTrevou_dataset_metadata.csv

 

images/

Contains subfolders per manuscript ID, each including extracted .png lines (via eScriptorium) from selected folios. Polygonal line extractions include alpha transparency and are deslanted.

Image naming convention: DocID_f<number>

Image rights: 

  • Reproductions from open-access and public collections:

    • Paris, Bibliothèque nationale de France — Gallica

    • Vienna, Austrian National Library

    • Oxford, Saint John’s College Library: By permission of the President and Fellows of St John’s College Oxford.

    • Paris, Bibliothèque Sainte-Geneviève

    • Cambridge, Massachusetts Library

    • IRHT library — microfilm reproduction of London, BL - ADD 15420 (Thanks to Gilles Kagan for providing the file)

     

    Purchased or licensed reproductions:

    • KB (Det Kongelige Bibliotek, Denmark): KB Thott 6, f.1 and 228v; KB Thott 6, f.229r and 472v (purchased high-resolution images);

    • KBR (Royal Library of Belgium): KBR – Cabinet des Estampes et des Dessins – S.V10319 f.7r, 90r, 175r / KBR – Cabinet des Estampes et des Dessins – S.V9507 f.4r and 125r/ KBR – Cabinet des Estampes et des Dessins – S.V9505-6 f.1v and 222r /KBR – Cabinet des Estampes et des Dessins – S.V11201 (purchased high-resolution images)

     

    Special thanks:

    • Koninklijke Bibliotheek, Netherlands — with special thanks to Ed Van der Vlist for providing high-quality reproductions at no cost for Koninklijke Bibliotheek, KW 78 D 41.

     

annotations.json

A JSON file containing line-level transcriptions and metadata. The dataset is CATMuS-compliant and uses a graphemic transcription approach. 

Structure example:

"<image_id>": {
  "split": "train",
  "label": "A beautiful calico cat.",  // Transcription text of the line
  "script": "RaouletOrleans",         // Scribal hand identifier
  "folio": "1r",
  "doc": "RO1"
}

 

RaouletOrleans_HenriTrevou_dataset_metadata.csv

A CSV file accompanies the dataset with folio-level metadata, for example:

 

Shelfmark

Colophon

DocID

FileID

Split

Disposition

TextualCategory

TextType

Text

NotBefore

NotAfter

Scribe

NbPages

Folios

NbLines

Paris, BnF NAF, 27401, f.159-194v et 255-266v

No

RO1_1

btv1b10532600x_f321

val

columns

Narratives

Historiography

Pierre Bersuire, Histoire Romaine de Tite-Live

1358

1361

RaouletOrleans

1

159r

85

Files

RaouletOrleans_HenriTrevou_dataset.zip

Files (1.3 GB)

Name Size Download all
md5:4401f9a9187f147a4fb9623c7c727b71
1.3 GB Preview Download

Additional details

Funding

European Research Council
DISCOVER 101076028
Centre National de la Recherche Scientifique
CreMe