Published May 7, 2021 | Version preprint
Journal article Open

Creating the European Literary Text Collection (ELTeC): Challenges and Perspectives

  • 1. Trier University, Germany
  • 2. Jožef Stefan Institute, Ljubljana, Slovenia
  • 3. Universitatea Alexandru Ioan Cuza, Romania
  • 4. University of Oslo, Norway

Description

Please refer to the version of record, published in open access and available here: http://doi.org/10.3828/mlo.v0i0.364

The aim of this contribution is to reflect on the process of building the multilingual European Literary Text Collection (ELTeC) that is being created in the framework of the COST Action on Distant Reading for European Literary History. To provide some background, we briefly introduce the basic idea of the ELTeC with a focus on the overall goals and the intended usage scenarios. We then describe the collection composition principles we have derived from the usage scenarios. In our discussion of the corpus building process, we focus on collections of novels from four different literary traditions as components of ELTeC: French, Portuguese, Romanian, and Slovenian, selected from 18 collections that are currently in preparation. For each collection, we describe some of the challenges we have encountered and the solutions we have developed while building ELTeC. In each case, the literary tradition, the history of the language, the current state of digitization of cultural heritage, the resources available locally, and the scholars' training level with regard to digitization and corpus building have been vastly different. How can we, in this context, hope to build comparable collections of novels that can usefully be integrated into a multilingual resource such as ELTeC and used in Distant Reading research? Based on our respective and collective experience with contributing to ELTeC, we end this contribution with some lessons learned regarding collaborative, multi-lingual corpus building.

The preprint made available here has been accepted at Modern Languages Open, https://www.modernlanguagesopen.org/. The version currently available here is the revised manuscript as submitted by the authors.

Notes

Please refer to the version of record, published in open access and available here: http://doi.org/10.3828/mlo.v0i0.364.

Files

Creating-ELTeC_2021-05-07-preprint.pdf

Files (333.7 kB)

Name Size Download all
md5:5fc7042757a2ff6eec9090b4500ad279
333.7 kB Preview Download