There is a newer version of the record available.

Published May 18, 2023 | Version v1
Dataset Open

AVID: Aalto Vocal Intensity Database

  • 1. Aalto University

Description

Data description:

AVID includes speech and EGG produced by 50 speakers (25 males, 25 females) who varied their vocal intensity in four categories (soft, normal, loud, and very loud). Recordings were conducted using a constant mouth-to-microphone distance and by recording a calibration tone. The speech data was labeled sentence-wise using a total of 19 labels that support the utilisation of the data in ML-based studies of vocal intensity based on supervised learning. Further information can be found in the 'readme.docx' file from the upload.

when collected the data:

Data is collected in 2021

Citation:

P. Alku, M. Kodali, L. Laaksonen, S.R. Kadiri, AVID: A speech database for machine learning studies on vocal intensity, Speech Communication, 2024 (Accepted/In press).

 

Files

AVID.zip

Files (9.9 GB)

Name Size Download all
md5:f103fe9592dc78a58915e72b84c08f79
9.9 GB Preview Download
md5:6c73676f381e4c1b9fe111cec358ae8b
38.2 kB Download

Additional details

Funding

Research Council of Finland
Speech-based biomarking of heart failure 330139