childPoeDE: A corpus of German Children's Poems for Computational and Experimental Studies - Metadata
- 1. Johannes Gutenberg-Universität Mainz
- 2. Universität Basel
- 3. Freie Universität Berlin
Description
The childPoeDE corpus is a collection of 1082 German poems for children created within the CHYLSA project. The poems were taken from anthologies published between 1991 and 2019. This publication includes the poem-level metadata for each poem with information about the author, the poem's length, data on case, punctuation, layout, rhyme, type-token ratio (TTR and MATTR) and lexical density. It also includes token-level metadata, namely word length and position, POS tags in different levels of granularity as well as data on onomatopoeia and sonority. Furthermore, this publication provides a word frequency table and a Python script which was used to extract some of the metadata from the texts (poemtool.py). The childPoeDE corpus does not contain all poems from the anthologies. A list of the poems that have been omitted for different reasons (length, language, typography, ...) can be accessed as well.
Read more about the childPoeDE corpus in our data paper published in the Journal of Open Humanities Data: The ChildPoeDE Corpus: 1082 German Children’s Poems for Computational and Experimental Studies on Poetry Reception.
DFG Schwerpunktprogramm SPP 2207 “Computational Literary Studies“
Online:
Subproject: „CHYLSA (Children’s and Youth Literature Sentiment Analysis)“
Online:
Files
childPoeDE_poem_omissions.csv
Files
(27.5 MB)
Name | Size | Download all |
---|---|---|
md5:45136ef6eb76178728ae493bd3119743
|
9.4 kB | Preview Download |
md5:78ebf9bc44e750b5614423ba0cc54b46
|
2.6 MB | Preview Download |
md5:025069a2d2614848c3583f0125cee83d
|
285.5 kB | Preview Download |
md5:574f320dfe9c62ca4ed466488d5480e9
|
22.8 MB | Preview Download |
md5:fe852e41947364c1c37eff6638c20571
|
143.3 kB | Preview Download |
md5:f30af76d3ce5da99070bbddbed32b065
|
1.6 MB | Preview Download |
md5:6d4e957064be855be1a61e156d742b3b
|
21.5 kB | Download |
md5:a735e46bb27e5f5ebbcda5a1ec59433d
|
4.4 kB | Preview Download |
md5:442e9887365e65be69adef9be038c25c
|
4.3 kB | Preview Download |
md5:e88de71660734c92766ab8793cd60795
|
1.8 kB | Preview Download |
md5:58c357a0f2016a0192fe884156fc003e
|
973 Bytes | Preview Download |