SentiANNO: Annotating Sentiment in Austrian Historical Newspapers
- 1. Department for Digital Humanities, University of Graz, Austria
Contributors
Data managers:
- 1. Universität zu Köln
- 2. Universität Passau
- 3. Universität Bielefeld
- 4. Univeristät Bielefeld
- 5. Digital Humanities im deutschsprachigen Raum
Description
This contribution presents SentiANNO, a sentiment-annotated corpus from historical Austrian newspapers; and describes the annotation process and the training of annotators. The corpus, which covers journalistic texts in German from 1800 to 1938, addresses the lack of sentiment-annotated resources for historical newspapers. Such a resource can be used to fine-tune existing Machine Learning models for Sentiment Analysis.The corpus includes texts from ANNO and DIGITARIUM collections, categorized into four sentiment categories (positive, negative, neutral and mixed) by three non-expert annotators. Throughout the annotation process, Doccano proved to be the most effective annotation tool, with preliminary results showing over 70% inter-annotator agreement despite genre and language complexity. The corpus will be publicly available on Zenodo, supporting open access.
Files
KRUSIC_Lucija_SentiANNO__Annotating_Sentiment_in_Austrian_Hi.pdf
Files
(5.6 MB)
Name | Size | Download all |
---|---|---|
md5:a79d7c0a20dd8cf83da9b58377032bd8
|
5.6 MB | Preview Download |
Additional details
Related works
- Is supplement to
- Conference paper: 10.5281/zenodo.14943161 (DOI)