Published January 1, 2015 | Version v1
Software Open

Sources and Perl scripts for the Poole-Camps-Cafiero Stemmatological Algorithm

  • 1. École nationale des chartes

Description

This repository contains the original Perl scripts and datasets, that were
used in the paper:

- Jean-Baptiste Camps & Florian Cafiero, ``Genealogical variant locations and simplified stemma: a test case'', in Analysis of Ancient and Medieval Texts and Manuscripts: Digital Approaches, ed. Tara Andrews & Caroline Macé, Turnhout, 2015 (Lectio, 1), p. 69‑93.

The scripts and data are offered in the state in which they were used for the original
version of the paper. We do not advise use of these scripts, but encourage
using the actualised version of the software, provided in the form of a package
for the statistical software _R_, available on Github,
http://github.com/Jean-Baptiste-Camps/stemmatology.

In this repository, you will find:

  • the original version of the three scripts (root of the repository);
  • a folder for each of the data-sets (only Parzival and Fournival were actually used in the paper), containing
    • *.csv, the numeric encoded format, to use with the scripts;
    • *.ods, the spreadsheet in which variants were labelled and selected.

The scripts implement a revised version of the algorithm that was invented by:

  • Eric Poole, ``The Computer in Determining Stemmatic Relationships'', Computers and the Humanities, 8-4 (1974), p. 207‑16 ;
  • Eric Poole, ``L’analyse stemmatique des textes documentaires'', in La pratique des ordinateurs dans la critique des textes, Paris: CNRS Éditions, 1979, p. 151‑61 ;

and then revised and extend by Camps & Cafiero 2015.

The script `EliminationdePoole` is for identifying conflicting Variant locations,
while `AgregationdePoole` is to be used to group manuscripts, and `Reconstruction`
to reconstruct the virtual model of a given group of manuscripts.

The two datasets that were used for the paper, Parzival and Fournival, come from,

  • Parzival: M. Spencer et al., `Phylogenetics of artificial manuscripts', Journal of theoretical biology, 227-4 (2004), p. 503–511 ;
  • Fournival: Richard de Fournival, Li bestiaires d’amours di maistre Richart de Fornival e li response du bestiaire, ed. Cesare Segre, Naples, 1957.

The other datasets are partial and in an unfinished state, they have their source in:

  • Heinrichi: Roos, Teemu, and Heikkilä, Tuomas, ``Evaluating methods for computer-assisted stemmatology using artificial benchmark data sets'', Literary and Linguistic Computing, 24-4 (2009), p. 417–433.
  • Notre-Besoin: Baret, Philippe V., Robinson, P., and Macé, C., ``Testing methods on an artificially created textual tradition'', Linguistica computazionale, 24 (2004), p. 1000–1029.

The fulls sources for most of these datasets can be obtained through the
site of the 2007 CASC:

  • Roos, Teemu, Heikkilä, Tuomas, and Myllymäki, Petri, Computer-Assisted Stemmatology Challenge, 2007, https://www.cs.helsinki.fi/u/ttonteri/casc/data.html.

Files

Files (765.4 kB)

Name Size Download all
md5:6996d389815496d4f20aeb446765c276
4.9 kB Download
md5:d60b092131c450bea586d42b94ab9479
6.1 kB Download
md5:d887e4a59fa050f11c423aad52a99b46
32.3 kB Download
md5:4f1058a59d8e749e60fa6c59a08866dc
536.1 kB Download
md5:c59d4fb6f3dff0ae77bc5f633e4b7dd8
83.5 kB Download
md5:91fa83a9d6e74fc1494fc753f5d02df6
92.7 kB Download
md5:2f0e0b56910c3daa9c3f1ea8a76da104
10.0 kB Download

Additional details

References

  • Jean-Baptiste Camps & Florian Cafiero, ``Genealogical variant locations and simplified stemma: a test case'', in _Analysis of Ancient and Medieval Texts and Manuscripts: Digital Approaches_, ed. Tara Andrews & Caroline Macé, Turnhout, 2015 (Lectio, 1), p. 69‑93.