Published March 10, 2023 | Version v1
Conference paper Open

Onboard onto DraCor. Prototyping Workflows to Homogenize Drama Corpora for an Open Infrastructure

  • 1. Universität Potsdam, Deutschland
  • 2. Freie Universität Berlin, Deutschland
  • 3. University of Oxford, Vereinigtes Königreich
  • 1. Universität Potsdam, Deutschland
  • 2. Digital Humanities im deutschsprachigen Raum e.V., Deutschland
  • 3. University of Luxembourg
  • 4. Universität Trier, Deutschland

Description

The process of onboarding new texts onto already established platforms, such as the Drama Corpora (DraCor) ecosystem, poses several challenges in terms of data curation and homogenization. We present here for discussion the prototypes of some pipelines, workflows, and tools embedding plays from diverse sources and formats into the DraCor environment. As a showcase of our approach, we also report on the building process of two new corpora (the English-language EPDraCor and the Ukrainian UDraCor), whose different sources require a flexible and tailored approach. Ein Beitrag zur 9. Tagung des Verbands "Digital Humanities im deutschsprachigen Raum" - DHd 2023 Open Humanities Open Culture.

Files

GIOVANNINI_Luca_Onboard_onto_DraCor.pdf

Files (76.6 kB)

Name Size Download all
md5:b6874de68b4f84b9f0cbf0a0a18addb5
60.4 kB Preview Download
md5:3587b07bffc771c93f9b71065b80e2f1
16.2 kB Preview Download

Additional details

Related works

Is part of
Book: 10.5281/zenodo.7688632 (DOI)
Is supplemented by
Poster: 10.5281/zenodo.7711513 (DOI)