Dataset Restricted Access

LHA Sentence Alignments Extracted From the Austria Press Agency Corpus

Spring, Nicolas; Rios, Annette; Ebling, Sarah

Conference paper: Exploring German Multi-Level Text Simplification

Sentence alignments extracted with LHA (Nikolov and Hahnloser, 2019) from the Austria Press Agency (Austria Presse Agentur, APA) corpus. It contains alignments from news items between August 2018 and April 2021. There are alignments for CEFR levels A2 and B1 to the original standard German text.


Nikola I. Nikolov and Richard Hahnloser. 2019. Large-scale hierarchical alignment for data-driven text rewriting. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019), pages 844–853, Varna, Bulgaria. INCOMA Ltd.

Restricted Access

You may request access to the files in this upload, provided that you fulfil the conditions below. The decision whether to grant/deny access is solely under the responsibility of the record owner.

Please specify name, email address, affiliation and motivation. Thanks.

All versions This version
Views 285285
Downloads 1414
Data volume 45.7 MB45.7 MB
Unique views 215215
Unique downloads 1414


Cite as