Planned intervention: On Wednesday April 3rd 05:30 UTC Zenodo will be unavailable for up to 2-10 minutes to perform a storage cluster upgrade.
Published May 15, 2023 | Version 2.0
Dataset Open

childPoeDE: A corpus of German Children's Poems for Computational and Experimental Studies - Metadata

  • 1. Johannes Gutenberg-Universität Mainz
  • 2. Universität Basel
  • 3. Freie Universität Berlin

Description

The childPoeDE corpus is a collection of 1082 German poems for children created within the CHYLSA project. The poems were taken from anthologies published between 1991 and 2019. This publication includes the poem-level metadata for each poem with information about the author, the poem's length, data on case, punctuation, layout, rhyme, type-token ratio (TTR and MATTR) and lexical density. It also includes token-level metadata, namely word length and position, POS tags in different levels of granularity as well as data on onomatopoeia and sonority. Furthermore, this publication provides a word frequency table and a Python script which was used to extract some of the metadata from the texts (poemtool.py). The childPoeDE corpus does not contain all poems from the anthologies. A list of the poems that have been omitted for different reasons (length, language, typography, ...) can be accessed as well.

Read more about the childPoeDE corpus in our data paper published in the Journal of Open Humanities Data: The ChildPoeDE Corpus: 1082 German Children’s Poems for Computational and Experimental Studies on Poetry Reception.

DFG Schwerpunktprogramm SPP 2207 “Computational Literary Studies“
Online:

  1. https://gepris.dfg.de/gepris/projekt/402743989
  2. https://dfg-spp-cls.github.io/

Subproject: „CHYLSA (Children’s and Youth Literature Sentiment Analysis)“

Online:

  1. https://gepris.dfg.de/gepris/projekt/424250469
  2. https://dfg-spp-cls.github.io/projects_en/2020/01/24/TP-CHYLSA/

Files

childPoeDE_poem_omissions.csv

Files (27.5 MB)

Name Size Download all
md5:45136ef6eb76178728ae493bd3119743
9.4 kB Preview Download
md5:78ebf9bc44e750b5614423ba0cc54b46
2.6 MB Preview Download
md5:025069a2d2614848c3583f0125cee83d
285.5 kB Preview Download
md5:574f320dfe9c62ca4ed466488d5480e9
22.8 MB Preview Download
md5:fe852e41947364c1c37eff6638c20571
143.3 kB Preview Download
md5:f30af76d3ce5da99070bbddbed32b065
1.6 MB Preview Download
md5:6d4e957064be855be1a61e156d742b3b
21.5 kB Download
md5:a735e46bb27e5f5ebbcda5a1ec59433d
4.4 kB Preview Download
md5:442e9887365e65be69adef9be038c25c
4.3 kB Preview Download
md5:e88de71660734c92766ab8793cd60795
1.8 kB Preview Download
md5:58c357a0f2016a0192fe884156fc003e
973 Bytes Preview Download