HeliPaD: the Heliand Parsed Database
Description
This corpus contains all 5,968 lines of the C manuscript of the Old Saxon Heliand, a gospel harmony written in alliterative verse, using the Sievers (1878) edition. Compared to the standard Behaghel critical edition, this one has the advantages for linguistic research that a) it does not conflate the different forms found in different manuscripts, b) it is not as heavily emended, and c) it is now in the public domain.
The corpus is a UTF-8 plain text file designed to be searched using the program CorpusSearch 2, with the standard extension .psd, broadly following the format of the Penn Corpora of Historical English and related projects (IcePaHC, Early New High German Parsed Corpus, MCVF). It is annotated on a number of levels:
- Textual and metrical (page in manuscript, page in edition, line number, caesura)
- Lemmatization
- Parts of speech and morphology
- Syntactic parsing
The total size of the corpus is 46,067 words (not including punctuation and code).
Notes
Files
HeliPaD-manual.pdf
Files
(4.5 MB)
Name | Size | Download all |
---|---|---|
md5:ea110d34d38b396bfef0d96f2f7e9016
|
3.5 MB | Download |
md5:78c58213d31abfb44cf1a0c0612afac8
|
952.0 kB | Preview Download |
Additional details
Related works
- Is documented by
- Journal article: 10.1075/ijcl.21.4.05wal (DOI)