Preprint Open Access

Extracting Citation Networks from Publications in Classics

Matteo, Romanello

Referring constitutes such an essential scholarly activity across disciplines that it has been regarded by Unsworth (2000) as one of the scholarly primitives. In Classics, in particular, the references to passages of the ancient texts—the so-called canonical citations (or references)—play a prominent role. The potential of these citations, however, has not been fully exploited to date, despite the attention that they have recently received in the field of Digital Humanities.

In this paper I discuss two aspects of making such citations computable. Firstly, I illustrate how they can be extracted from text by using Natural Language Processing techniques, especially Named Entity Recognition. Secondly, I discuss the creation of a three-level citation network to formalise the web of relations between texts that canonical references implicitly constitute. As I outline in the conclusive section of this paper, the possible uses of the extracted citation network include the development of search applications and recommender systems for bibliography; the enhancement of digital environments to read primary sources with links to related secondary literature and the application of these network to the study of intertextuality and text reception. 

This paper was published in a special issue of the Digital Humanities Quarterly journal edited by Neil Bernstein and Neil Coffee, see .
Files (6.3 MB)
Name Size
6.3 MB Download
All versions This version
Views 400400
Downloads 147146
Data volume 929.8 MB923.5 MB
Unique views 367367
Unique downloads 133132


Cite as