Onboard onto DraCor. Prototyping Workflows to Homogenize Drama Corpora for an Open Infrastructure
Creators
- 1. Universität Potsdam, Deutschland
- 2. Freie Universität Berlin, Deutschland
- 3. University of Oxford, Vereinigtes Königreich
Contributors
Editors:
Project members:
- 1. Universität Potsdam, Deutschland
- 2. Digital Humanities im deutschsprachigen Raum e.V., Deutschland
- 3. University of Luxembourg
- 4. Universität Trier, Deutschland
Description
The process of onboarding new texts onto already established platforms, such as the Drama Corpora (DraCor) ecosystem, poses several challenges in terms of data curation and homogenization. We present here for discussion the prototypes of some pipelines, workflows, and tools embedding plays from diverse sources and formats into the DraCor environment. As a showcase of our approach, we also report on the building process of two new corpora (the English-language EPDraCor and the Ukrainian UDraCor), whose different sources require a flexible and tailored approach. Ein Beitrag zur 9. Tagung des Verbands "Digital Humanities im deutschsprachigen Raum" - DHd 2023 Open Humanities Open Culture.
Files
GIOVANNINI_Luca_Onboard_onto_DraCor.pdf
Files
(76.6 kB)
Name | Size | Download all |
---|---|---|
md5:b6874de68b4f84b9f0cbf0a0a18addb5
|
60.4 kB | Preview Download |
md5:3587b07bffc771c93f9b71065b80e2f1
|
16.2 kB | Preview Download |
Additional details
Related works
- Is part of
- Book: 10.5281/zenodo.7688632 (DOI)
- Is supplemented by
- Poster: 10.5281/zenodo.7711513 (DOI)