Published February 5, 2021
| Version 1.1
Other
Restricted
Corpus for Automatic Readability Assessment and Text Simplification of German
Description
Python scripts that download the data from the web, process it and produce an HTML or a TETML file as well as TCF and plain text files as described in the paper "Corpus for Automatic Readability Assessment and Text Simplification of German".
Files
Additional details
References
- Battisti et al. (2020). A Corpus for Automatic Readability Assessment and Text Simplification of German. In Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), pages 3302–3311 Marseille, 11–16 May 2020, ELRA.