Software Open Access

Sources and Perl scripts for the Poole-Camps-Cafiero Stemmatological Algorithm

Camps, Jean-Baptiste

Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="" xmlns:oai_dc="" xmlns:xsi="" xsi:schemaLocation="">
  <dc:creator>Camps, Jean-Baptiste</dc:creator>
  <dc:description>This repository contains the original Perl scripts and datasets, that were
used in the paper:

- Jean-Baptiste Camps &amp; Florian Cafiero, ``Genealogical variant locations and simplified stemma: a test case'', in Analysis of Ancient and Medieval Texts and Manuscripts: Digital Approaches, ed. Tara Andrews &amp; Caroline Macé, Turnhout, 2015 (Lectio, 1), p. 69‑93.

The scripts and data are offered in the state in which they were used for the original
version of the paper. We do not advise use of these scripts, but encourage
using the actualised version of the software, provided in the form of a package
for the statistical software _R_, available on Github,

In this repository, you will find:

	the original version of the three scripts (root of the repository);
	a folder for each of the data-sets (only Parzival and Fournival were actually used in the paper), containing
		*.csv, the numeric encoded format, to use with the scripts;
		*.ods, the spreadsheet in which variants were labelled and selected.

The scripts implement a revised version of the algorithm that was invented by:

	Eric Poole, ``The Computer in Determining Stemmatic Relationships'', Computers and the Humanities, 8-4 (1974), p. 207‑16 ;
	Eric Poole, ``L’analyse stemmatique des textes documentaires'', in La pratique des ordinateurs dans la critique des textes, Paris: CNRS Éditions, 1979, p. 151‑61 ;

and then revised and extend by Camps &amp; Cafiero 2015.

The script `EliminationdePoole` is for identifying conflicting Variant locations,
while `AgregationdePoole` is to be used to group manuscripts, and `Reconstruction`
to reconstruct the virtual model of a given group of manuscripts.

The two datasets that were used for the paper, Parzival and Fournival, come from,

	Parzival: M. Spencer et al., `Phylogenetics of artificial manuscripts', Journal of theoretical biology, 227-4 (2004), p. 503–511 ;
	Fournival: Richard de Fournival, Li bestiaires d’amours di maistre Richart de Fornival e li response du bestiaire, ed. Cesare Segre, Naples, 1957.

The other datasets are partial and in an unfinished state, they have their source in:

	Heinrichi: Roos, Teemu, and Heikkilä, Tuomas, ``Evaluating methods for computer-assisted stemmatology using artificial benchmark data sets'', Literary and Linguistic Computing, 24-4 (2009), p. 417–433.
	Notre-Besoin: Baret, Philippe V., Robinson, P., and Macé, C., ``Testing methods on an artificially created textual tradition'', Linguistica computazionale, 24 (2004), p. 1000–1029.

The fulls sources for most of these datasets can be obtained through the
site of the 2007 CASC:

	Roos, Teemu, Heikkilä, Tuomas, and Myllymäki, Petri, Computer-Assisted Stemmatology Challenge, 2007,
  <dc:subject>textual criticism</dc:subject>
  <dc:subject>romance philology</dc:subject>
  <dc:title>Sources and Perl scripts for the Poole-Camps-Cafiero Stemmatological Algorithm</dc:title>
All versions This version
Views 3232
Downloads 1414
Data volume 1.6 MB1.6 MB
Unique views 3232
Unique downloads 44


Cite as