Software Open Access

redewiedergabe/corpus: First public release of the "Redewiedergabe" corpus

redewiedergabe; NDTanja; Lukas Weimer

This beta release contains the first subset of the "Redewiedergabe" corpus. It includes 619 text samples and 360,974 tokens. 9,451 STWR instances have been annotated, as well as additional information like frames, introductory expressions and speakers. Available formats are TEI compatible XML and a column-based text format.

Files (5.8 MB)
Name Size
redewiedergabe/corpus-v0.1.0-beta.zip
md5:c0662a1dd25c0919a3cc455f352f4dfb
5.8 MB Download
12
2
views
downloads
All versions This version
Views 1212
Downloads 22
Data volume 11.5 MB11.5 MB
Unique views 88
Unique downloads 22

Share

Cite as