Published May 2, 2018 | Version 1.0
Dataset Open

Character mentions in the German novel "Corpus Delicti" by Juli Zeh and annotations

  • 1. Universität Hamburg
  • 2. Technische Universität Hamburg

Description

This file contains all character mentions in the German novel "Corpus Delicti" by Juli Zeh. The annotation was conducted by members of the research group hermA (www.herma.uni-hamburg.de). The file includes the following columns:

  • id
  • token: tokens of the character mention
  • entity_nr: the (arbitrary) entity number. All mentions of one character share the same entity number.
  • token_nr: position of the mention in the text in tokens
  • sentence_nr: position of the mention in the text in sentences
  • chapter: number of the chapter the mention occurs in (49 chapters in total)
  • direct_speech: True if the mention occurs between quotation marks
  • entity_name: a mapping of the entity number to the most frequent proper name used for the entity (if available)
  • form: grammatical form of the mention, derived from an automatic part-of-speech tagging (NE = proper name; NP = noun phrase; PPER = personal pronoun; PPOSAT = possessive pronoun)

 

 

Files

mentions_corpus_delicti.txt

Files (295.1 kB)

Name Size Download all
md5:9fb6b8ee1f7331fe88dfc319882ecd2b
295.1 kB Preview Download