Presentation Open Access

How we tripled our encoding speed in the Digital Victorian Periodical Poetry project

Holmes, Martin; Fralick, Kaitlyn; Fukushima, Kailey; Karlson, Sarah


Citation Style Language JSON Export

{
  "publisher": "Zenodo", 
  "DOI": "10.5281/zenodo.3449241", 
  "language": "eng", 
  "title": "How we tripled our encoding speed in the Digital Victorian Periodical Poetry project", 
  "issued": {
    "date-parts": [
      [
        2019, 
        9, 
        19
      ]
    ]
  }, 
  "abstract": "<p>The Digital Victorian Periodical Poetry (DVPP) project is a SSHRC-funded digital humanities<br>\nproject based at the University of Victoria. With the guidance of principal investigator Dr. Alison<br>\nChapman, the DVPP team is creating a digital index of British periodical poetry from the long<br>\nnineteenth century. In addition to uncovering periodical poems, writing descriptive metadata, and<br>\ncompiling prosopographical research, we are currently using TEI and CSS to encode a statistically-<br>\nrepresentative sample of indexed poems, looking for quantitative evidence of literary change over<br>\ntime. Such an endeavour requires a large, robust dataset covering a range of periodicals throughout<br>\nthe period.<br>\nAt the time of writing, there are more than 13,000 poems in the database, and we expect that total<br>\nto reach 20,000. Of these, around 2,000 will be encoded, focusing on the decade years (1820, 1830,<br>\n1840, and so on).<br>\nJournal of the Text Encoding Initiative,<br>\n1How we tripled our encoding speed in the Digital Victorian Periodical Project<br>\nIn this presentation, we will showcase the various strategies and tools we have used to speed up<br>\nour encoding process. We combine simple tricks like keyboard shortcuts with more sophisticated<br>\nprocesses to minimize drudgery and increase accuracy. Among the more interesting techniques<br>\nare:<br>\n&bull; Auto-tagging of a complete poem in lines and linegroups using a Schematron QuickFix;<br>\n&bull; Use of advanced CSS selectors in the rendition/@selector attribute to reduce encoding<br>\nclutter in the poem itself;<br>\n&bull;<br>\nA keyboard shortcut to tag rhymes which detects whether the tagged text is a masculine<br>\nor feminine rhyme and provides the appropriate attribute value;<br>\n&bull;<br>\nAuto-detection of cases where a new line-end rhymes with a previously-encoded rhyme,<br>\nand should, therefore, be labelled to match it, leveraging our growing dataset of nearly<br>\n30,000 rhymes;<br>\n&bull;<br>\nInstant access to to a rendering of the poem which provides a visualization of the rhyme<br>\nstructure, auto-detection of anaphora, epistrophe and other refrain-like forms, and other<br>\ndiagnostic feedback.</p>", 
  "author": [
    {
      "family": "Holmes, Martin"
    }, 
    {
      "family": "Fralick, Kaitlyn"
    }, 
    {
      "family": "Fukushima, Kailey"
    }, 
    {
      "family": "Karlson, Sarah"
    }
  ], 
  "type": "speech", 
  "id": "3449241"
}
515
48
views
downloads
All versions This version
Views 515515
Downloads 4848
Data volume 107.4 MB107.4 MB
Unique views 502502
Unique downloads 4646

Share

Cite as