Published February 26, 2025
| Version v1
Conference paper
Open
National Library as Corpus: Introducing DeLiKo@DNB – a Large Synchronous German Fiction Corpus
Creators
- 1. Leibniz-Institut für Deutsche Sprache, Germany
- 2. Deutsche Nationalbibliothek, Germany
Contributors
Data managers:
- 1. Universität zu Köln
- 2. Universität Passau
- 3. Universität Bielefeld
- 4. Digital Humanities im deutschsprachigen Raum
Description
This paper introduces DeLiKo@DNB, a large, linguistically annotated, and large, freely accessible contemporary corpus of German fiction. The corpus currently comprises 2 billion words from over 26,000 books published between 2005 and the present, spanning pulp and genre fiction as well as literary award-winning works. We provide a detailed account of the corpus composition, metadata, and key features. Additionally, we outline our approach to ensuring lawful and productive access by deploying an instance of the open-source corpus analysis platform KorAP within the German National Library.
Files
KUPIETZ_Marc_National_Library_as_Corpus__Introducing_DeLiKo_.pdf
Files
(279.5 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:b656f0816e9f78f100c0b7858dca0560
|
255.5 kB | Preview Download |
|
md5:91a1d50ca28403b7ae7f6f57b4a25f85
|
24.0 kB | Preview Download |
Additional details
Related works
- Is part of
- Book: 10.5281/zenodo.14887460 (DOI)