Published September 10, 2020 | Version 1.0
Dataset Open

AriEmozione 1.0

Description

The corpus AriEmozione 1.0 contains a selection of operas composed between 1655 and 1765, with each verse annotated (labeled) with an emotion.
AriEmozione  1.0 is a subset of the materials collected by project CORAGO.
We  consider only the lyrical text of the arias in each of the 678 operas included.  All texts are written in Italian of that period and articulated in verses.

The following classes have been used to identify emotions:

  • Amore (Love)
  • Gioia (Joy)
  • Ammirazione (Admiration)
  • Rabbia (Anger)
  • Tristezza (Sadness)
  • Paura (Fear)
  • Nessuna (Nothing)

 

The corpus is composed of training (1973 verses), development (250 verses) and testing (250 verses) partitions. Data partitioning made use of Kullback–Leibler divergence to create partitions with similar probability distribution.

Each partition tsv file contains the following columns:

  • Verse ID : unique aria and verse ID
  • Verse text: the text of the verse in the aria
  • Emotion: one of the six emotions described above, or Nessuna.
  • Confidence: expression of confidence by the annotators, with the possible values of "Totalmente sicura" (very confident), "Abbastanza sicura" (somewhat confident), "Forti dubbi" (very doubtful).

How to cite:

@inproceedings{fernicola2020ariemozione,
  title={AriEmozione: Identifying Emotions in Opera Verses},
  author={Fernicola, Francesco and Zhang, Shibingfeng and Garcea, Federico and Bonora, Paolo and Barr{\'o}n-Cede\~no, Alberto},
  booktitle={Italian Conference on Computational Linguistics},
  year={2020}
}

Files

Files (283.5 kB)

Name Size Download all
md5:d7601b90d465734c559f93561f4f3621
29.2 kB Download
md5:94e7d3e2a480747e1d5f79b698178d27
28.6 kB Download
md5:3bdcc14d5b3f332f0a7d4dce669eb6b0
225.7 kB Download

Additional details

References

  • Fernicola, Zhang, Federico Garcea, Bonora, and Barrón-Cedeño (2020). AriEmozione: Identifying Emotions in Opera Verses. In Proceedings of the Italian Conference on Computational Linguistics (CLIC-it 2020), Bologna, Italy, 2021