Presentation Open Access

How we tripled our encoding speed in the Digital Victorian Periodical Poetry project

Holmes, Martin; Fralick, Kaitlyn; Fukushima, Kailey; Karlson, Sarah


JSON-LD (schema.org) Export

{
  "inLanguage": {
    "alternateName": "eng", 
    "@type": "Language", 
    "name": "English"
  }, 
  "description": "<p>The Digital Victorian Periodical Poetry (DVPP) project is a SSHRC-funded digital humanities<br>\nproject based at the University of Victoria. With the guidance of principal investigator Dr. Alison<br>\nChapman, the DVPP team is creating a digital index of British periodical poetry from the long<br>\nnineteenth century. In addition to uncovering periodical poems, writing descriptive metadata, and<br>\ncompiling prosopographical research, we are currently using TEI and CSS to encode a statistically-<br>\nrepresentative sample of indexed poems, looking for quantitative evidence of literary change over<br>\ntime. Such an endeavour requires a large, robust dataset covering a range of periodicals throughout<br>\nthe period.<br>\nAt the time of writing, there are more than 13,000 poems in the database, and we expect that total<br>\nto reach 20,000. Of these, around 2,000 will be encoded, focusing on the decade years (1820, 1830,<br>\n1840, and so on).<br>\nJournal of the Text Encoding Initiative,<br>\n1How we tripled our encoding speed in the Digital Victorian Periodical Project<br>\nIn this presentation, we will showcase the various strategies and tools we have used to speed up<br>\nour encoding process. We combine simple tricks like keyboard shortcuts with more sophisticated<br>\nprocesses to minimize drudgery and increase accuracy. Among the more interesting techniques<br>\nare:<br>\n&bull; Auto-tagging of a complete poem in lines and linegroups using a Schematron QuickFix;<br>\n&bull; Use of advanced CSS selectors in the rendition/@selector attribute to reduce encoding<br>\nclutter in the poem itself;<br>\n&bull;<br>\nA keyboard shortcut to tag rhymes which detects whether the tagged text is a masculine<br>\nor feminine rhyme and provides the appropriate attribute value;<br>\n&bull;<br>\nAuto-detection of cases where a new line-end rhymes with a previously-encoded rhyme,<br>\nand should, therefore, be labelled to match it, leveraging our growing dataset of nearly<br>\n30,000 rhymes;<br>\n&bull;<br>\nInstant access to to a rendering of the poem which provides a visualization of the rhyme<br>\nstructure, auto-detection of anaphora, epistrophe and other refrain-like forms, and other<br>\ndiagnostic feedback.</p>", 
  "license": "https://creativecommons.org/licenses/by/4.0/legalcode", 
  "creator": [
    {
      "affiliation": "University of Victoria HCMC", 
      "@id": "https://orcid.org/0000-0002-3944-1116", 
      "@type": "Person", 
      "name": "Holmes, Martin"
    }, 
    {
      "affiliation": "University of Victoria", 
      "@type": "Person", 
      "name": "Fralick, Kaitlyn"
    }, 
    {
      "affiliation": "University of Victoria", 
      "@type": "Person", 
      "name": "Fukushima, Kailey"
    }, 
    {
      "affiliation": "University of Victoria", 
      "@type": "Person", 
      "name": "Karlson, Sarah"
    }
  ], 
  "url": "https://zenodo.org/record/3449241", 
  "datePublished": "2019-09-19", 
  "@context": "https://schema.org/", 
  "identifier": "https://doi.org/10.5281/zenodo.3449241", 
  "@id": "https://doi.org/10.5281/zenodo.3449241", 
  "@type": "PresentationDigitalDocument", 
  "name": "How we tripled our encoding speed in the Digital Victorian Periodical Poetry project"
}
539
66
views
downloads
All versions This version
Views 539539
Downloads 6666
Data volume 147.6 MB147.6 MB
Unique views 524524
Unique downloads 6464

Share

Cite as