Emozionalmente: a crowdsourced Italian speech emotional corpus

Catania, Fabio

doi:10.5281/zenodo.6569824

Published May 25, 2022 | Version 1.0

Dataset Open

Emozionalmente: a crowdsourced Italian speech emotional corpus

Catania, Fabio¹

1. Politecnico di Milano

This repository contains Emozionalmente: an extensive Italian speech emotional corpus. The dataset consists of 6902 labeled samples acted out by 431 amateur actors while verbalizing 18 different sentences expressing the Big Six emotions (anger, disgust, fear, joy, sadness, surprise) plus neutrality. Labels represent the emotional communicative intention of the actors (i.e., the seven emotional states).

Recordings were generally obtained with non-professional equipment. They are .wav files, mono-channel, and have a sample size of 16 bits and a sample rate of 16000 Hz. Each audio recording lasts 3.81 seconds (SD = 0.99 seconds).

We validated the emotional content of the clips by asking 829 humans (5 evaluations per audio) to guess the emotion contained in each recording. Humans obtained a general accuracy of 66% (comparable with previous literature in the field).

Beyond the audio data, in the repo, you can find three .csv files describing the demographics of the actors and the evaluators and the emotions they expressed and recognized for each audio.

Soon we will publish a paper with more information about the dataset, including data acquisition methodology, participant demographics, and data validation. Also, to help your research in Speech Emotion Recognition, we share the code for training and testing an emotional classifier on the Emozionalmente dataset (81% accuracy).

If you use this dataset, please cite "Emozionalmente: a crowd-sourced Italian speech emotional corpus" by Fabio Catania.

Files

emozionalmente_dataset.zip

Files (581.4 MB)

Name	Size
emozionalmente_dataset.zip md5:4b06145cc92885267c2f1df9c7e693f8	581.4 MB	Preview Download

	All versions	This version
Views	3,349	2,418
Downloads	817	546
Data volume	534.6 GB	365.1 GB

Emozionalmente: a crowdsourced Italian speech emotional corpus

Authors/Creators

Description

Files

emozionalmente_dataset.zip

Files (581.4 MB)