Presentation Open Access

How we tripled our encoding speed in the Digital Victorian Periodical Poetry project

Holmes, Martin; Fralick, Kaitlyn; Fukushima, Kailey; Karlson, Sarah


JSON Export

{
  "files": [
    {
      "links": {
        "self": "https://zenodo.org/api/files/f337e9d6-29b9-4de8-b14a-539e158e54dc/encoding_speed.pdf"
      }, 
      "checksum": "md5:444e60f0b72bf7ffcdb58bf0d1d7adc5", 
      "bucket": "f337e9d6-29b9-4de8-b14a-539e158e54dc", 
      "key": "encoding_speed.pdf", 
      "type": "pdf", 
      "size": 2236806
    }
  ], 
  "owners": [
    77239
  ], 
  "doi": "10.5281/zenodo.3449241", 
  "stats": {
    "version_unique_downloads": 64.0, 
    "unique_views": 524.0, 
    "views": 539.0, 
    "version_views": 539.0, 
    "unique_downloads": 64.0, 
    "version_unique_views": 524.0, 
    "volume": 147629196.0, 
    "version_downloads": 66.0, 
    "downloads": 66.0, 
    "version_volume": 147629196.0
  }, 
  "links": {
    "doi": "https://doi.org/10.5281/zenodo.3449241", 
    "conceptdoi": "https://doi.org/10.5281/zenodo.3449240", 
    "bucket": "https://zenodo.org/api/files/f337e9d6-29b9-4de8-b14a-539e158e54dc", 
    "conceptbadge": "https://zenodo.org/badge/doi/10.5281/zenodo.3449240.svg", 
    "html": "https://zenodo.org/record/3449241", 
    "latest_html": "https://zenodo.org/record/3449241", 
    "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.3449241.svg", 
    "latest": "https://zenodo.org/api/records/3449241"
  }, 
  "conceptdoi": "10.5281/zenodo.3449240", 
  "created": "2019-09-19T13:26:55.398731+00:00", 
  "updated": "2020-01-20T17:20:17.882425+00:00", 
  "conceptrecid": "3449240", 
  "revision": 5, 
  "id": 3449241, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.5281/zenodo.3449241", 
    "description": "<p>The Digital Victorian Periodical Poetry (DVPP) project is a SSHRC-funded digital humanities<br>\nproject based at the University of Victoria. With the guidance of principal investigator Dr. Alison<br>\nChapman, the DVPP team is creating a digital index of British periodical poetry from the long<br>\nnineteenth century. In addition to uncovering periodical poems, writing descriptive metadata, and<br>\ncompiling prosopographical research, we are currently using TEI and CSS to encode a statistically-<br>\nrepresentative sample of indexed poems, looking for quantitative evidence of literary change over<br>\ntime. Such an endeavour requires a large, robust dataset covering a range of periodicals throughout<br>\nthe period.<br>\nAt the time of writing, there are more than 13,000 poems in the database, and we expect that total<br>\nto reach 20,000. Of these, around 2,000 will be encoded, focusing on the decade years (1820, 1830,<br>\n1840, and so on).<br>\nJournal of the Text Encoding Initiative,<br>\n1How we tripled our encoding speed in the Digital Victorian Periodical Project<br>\nIn this presentation, we will showcase the various strategies and tools we have used to speed up<br>\nour encoding process. We combine simple tricks like keyboard shortcuts with more sophisticated<br>\nprocesses to minimize drudgery and increase accuracy. Among the more interesting techniques<br>\nare:<br>\n&bull; Auto-tagging of a complete poem in lines and linegroups using a Schematron QuickFix;<br>\n&bull; Use of advanced CSS selectors in the rendition/@selector attribute to reduce encoding<br>\nclutter in the poem itself;<br>\n&bull;<br>\nA keyboard shortcut to tag rhymes which detects whether the tagged text is a masculine<br>\nor feminine rhyme and provides the appropriate attribute value;<br>\n&bull;<br>\nAuto-detection of cases where a new line-end rhymes with a previously-encoded rhyme,<br>\nand should, therefore, be labelled to match it, leveraging our growing dataset of nearly<br>\n30,000 rhymes;<br>\n&bull;<br>\nInstant access to to a rendering of the poem which provides a visualization of the rhyme<br>\nstructure, auto-detection of anaphora, epistrophe and other refrain-like forms, and other<br>\ndiagnostic feedback.</p>", 
    "language": "eng", 
    "title": "How we tripled our encoding speed in the Digital Victorian Periodical Poetry project", 
    "license": {
      "id": "CC-BY-4.0"
    }, 
    "relations": {
      "version": [
        {
          "count": 1, 
          "index": 0, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "3449240"
          }, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "3449241"
          }
        }
      ]
    }, 
    "communities": [
      {
        "id": "tei2019"
      }
    ], 
    "publication_date": "2019-09-19", 
    "creators": [
      {
        "orcid": "0000-0002-3944-1116", 
        "affiliation": "University of Victoria HCMC", 
        "name": "Holmes, Martin"
      }, 
      {
        "affiliation": "University of Victoria", 
        "name": "Fralick, Kaitlyn"
      }, 
      {
        "affiliation": "University of Victoria", 
        "name": "Fukushima, Kailey"
      }, 
      {
        "affiliation": "University of Victoria", 
        "name": "Karlson, Sarah"
      }
    ], 
    "access_right": "open", 
    "resource_type": {
      "type": "presentation", 
      "title": "Presentation"
    }, 
    "related_identifiers": [
      {
        "scheme": "doi", 
        "identifier": "10.5281/zenodo.3449240", 
        "relation": "isVersionOf"
      }
    ]
  }
}
539
66
views
downloads
All versions This version
Views 539539
Downloads 6666
Data volume 147.6 MB147.6 MB
Unique views 524524
Unique downloads 6464

Share

Cite as