Acoustic models of Brazilian Portuguese Speech based on Neural Transformers - Refinement dataset SPIRA
Creators
- Matheus Gauy, Marcelo1
- Finger, Marcelo1
- Aluisio, Sandra Maria1
- Spazzapan, Evelyn Alves2
- Berti, Larissa Cristina2
- Camargo Neto, Augusto César de1
- Candido Junior, Arnaldo2
- Casanova, Edresson1
- Svartman, Flaviane Romani Fernandes1
- Ferreira, Renato Cordeiro1
- Fernandes Jr, Ricardo3
- Goldman, Alfredo1
- Gris, Lucas R3
- Leyton, Pedro
- Levin, Anna Sara Shafferman1
- Martins, Marcus Vinicíus Moreira1
- Queiroz, Marcelo Gomes de1
- Quirino, J Henrique
- Medeiros, Beatriz Raposo de1
- Sabino, Ester Cerdeira1
- Silva, Daniel da3
- 1. Universidade de São Paulo - USP
- 2. Universidade Estadual Paulista - UNESP
- 3. Universidade Tecnológica Federal do Paraná - UTFPR
Description
This dataset was collected over the internet and in hospital wards with the goal of detecting respiratory insufficiency (typically caused by COVID-19). This data collection is part of the SPIRA Project, whose goal is developing a system for recognizing respiratory insufficiency through speech analysis. The datasets presented here were used in the paper: Acoustic models of Brazilian Portuguese Speech based on Neural Transformers by Marcelo Gauy and Marcelo Finger.
The spira_trimmed_data file contains the original ~1 hour dataset collected over the internet (control) and in hospital wards (patients) by the SPIRA Project. This is as described in the paper: Deep learning against COVID-19: Respiratory insufficiency detection in Brazilian Portuguese Speech. We include it here for completeness.
The spira_control_full_mp3 file contains the complete ~18 hours control data collected over the internet by the SPIRA project. While not useful for respiratory insufficiency detection, the dataset may be used for identifying age and gender as we mention in our paper: Acoustic models of Brazilian Portuguese Speech based on Neural Transformers.
Files
Files
(1.6 GB)
Name | Size | Download all |
---|---|---|
md5:a0100393ed53b8c28949b017afa5ef15
|
1.2 GB | Download |
md5:7f4cbde0f75618b58f92ee10e1fbd086
|
395.5 MB | Download |
Additional details
Related works
- Is supplemented by
- Dataset: 10.5281/zenodo.6794924 (DOI)
References
- Acoustic models of Brazilian Portuguese Speech based on Neural Transformers - Gauy, Marcelo e Finger, Marcelo 2022
- Deep learning against COVID-19: Respiratory insufficiency detection in Brazilian Portuguese Speech - Casanova, Edresson e Gris, Lucas e Camargo, Augusto e Silva, Daniel e Gazzola, Murilo e Sabino, Ester e Levin, Anna e Candido Jr, Arnaldo e Aluisio, Sandra e Finger, Marcelo 2021
- Detecting Respiratory Insufficiency via Voice Analysis: the SPIRA Project - Aluisio, Sandra e Camargo, Augusto e Candido Jr, Arnaldo e Fernandes Jr, Ricardo e Casanova, Edresson e Gris, Lucas e Svartman, Flaviane e Silva, Daniel e Ferreira, Renato e Finger, Marcelo e Goldman, Alfredo e Spazzapan, Evelyn e Leyton, Pedro e Berti, Larissa e Levin, Anna e Gauy, Marcelo e Martins, Marcus e Quirino, Henrique e Queiroz, Marcelo e Raposo, Beatriz e Sabino, Ester 2022
- Detecting respiratory insufficiency by voice analysis: the SPIRA project (2021) - Finger, Marcelo e Aluisio, Sandra e Spazzapan, Evelyn e Berti, Larissa e Camargo, Augusto e Candido Jr, Arnaldo e Casanova, Edresson e Svartman, Flaviane e Ferreira, Renato e Fernandes Jr, Ricardo e Goldman, Alfredo e Gris, Lucas e Leyton, Pedro e Levin, Anna e Martins, Marcus e Queiroz, Marcelo e Quirino, Henrique e Medeiros, Beatriz e Sabino, Ester e Silva, Daniel