Published February 26, 2025 | Version v1
Conference paper Open

National Library as Corpus: Introducing DeLiKo@DNB – a Large Synchronous German Fiction Corpus

  • 1. Leibniz-Institut für Deutsche Sprache, Germany
  • 2. Deutsche Nationalbibliothek, Germany
  • 1. Universität zu Köln
  • 2. Universität Passau
  • 3. Universität Bielefeld
  • 4. Digital Humanities im deutschsprachigen Raum

Description

This paper introduces DeLiKo@DNB, a large, linguistically annotated, and large, freely accessible contemporary corpus of German fiction. The corpus currently comprises 2 billion words from over 26,000 books published between 2005 and the present, spanning pulp and genre fiction as well as literary award-winning works. We provide a detailed account of the corpus composition, metadata, and key features. Additionally, we outline our approach to ensuring lawful and productive access by deploying an instance of the open-source corpus analysis platform KorAP within the German National Library.

Files

KUPIETZ_Marc_National_Library_as_Corpus__Introducing_DeLiKo_.pdf

Additional details

Related works

Is part of
Book: 10.5281/zenodo.14887460 (DOI)