AVID: Aalto Vocal Intensity Database

Manila Kodali; Paavo Alku; Sudarsana Reddy Kadiri

doi:10.5281/zenodo.10524873

Published January 17, 2024 | Version v4

Dataset Open

AVID: Aalto Vocal Intensity Database

1. Aalto University

Data description:

AVID includes speech and EGG produced by 50 speakers (25 males, 25 females) who varied their vocal intensity in four categories (soft, normal, loud, and very loud). Recordings were conducted using a constant mouth-to-microphone distance and by recording a calibration tone. The speech data was labeled sentence-wise using a total of 19 labels that support the utilisation of the data in ML-based studies of vocal intensity based on supervised learning. Further information can be found in the 'readme.docx' file from the upload.

when collected the data:

Data is collected in 2021

Citation:

P. Alku, M. Kodali, L. Laaksonen, S.R. Kadiri, AVID: A speech database for machine learning studies on vocal intensity, Speech Communication, Vol. 157, Article 103039, 2024. https://doi.org/10.1016/j.specom.2024.103039

Files

AVID.zip

Files (12.4 GB)

Name	Size	Download all
AVID.zip md5:f103fe9592dc78a58915e72b84c08f79	9.9 GB	Preview Download
Readme.docx md5:802db20897b9e9d9389377054ef2faa8	36.9 kB	Download
Repositoty 2,full band.zip md5:c6bb942c1991f2f5c0f692d5b843ac32	2.4 GB	Preview Download

Additional details

Research Council of Finland
Speech-based biomarking of heart failure 330139

Citations

Oops! Something went wrong while fetching results.

	All versions	This version
Views	854	394
Downloads	405	188
Data volume	2.6 TB	1.7 TB

AVID: Aalto Vocal Intensity Database

Creators

Description

Files

AVID.zip

Files (12.4 GB)

Additional details

Funding