Published June 22, 2022 | Version v1
Dataset Open

Acoustic models of Brazilian Portuguese Speech based on Neural Transformers - Refinement dataset SPIRA

Description

This dataset was collected over the internet and in hospital wards with the goal of detecting respiratory insufficiency (typically caused by COVID-19). This data collection is part of the SPIRA Project, whose goal is developing a system for recognizing respiratory insufficiency through speech analysis. The datasets presented here were used in the paper: Acoustic models of Brazilian Portuguese Speech based on Neural Transformers by Marcelo Gauy and Marcelo Finger.

The spira_trimmed_data file contains the original ~1 hour dataset collected over the internet (control) and in hospital wards (patients) by the SPIRA Project. This is as described in the paper: Deep learning against COVID-19: Respiratory insufficiency detection in Brazilian Portuguese Speech. We include it here for completeness.

The spira_control_full_mp3 file contains the complete ~18 hours control data collected over the internet by the SPIRA project. While not useful for respiratory insufficiency detection, the dataset may be used for identifying age and gender as we mention in our paper: Acoustic models of Brazilian Portuguese Speech based on Neural Transformers.

Files

Files (1.6 GB)

Name Size Download all
md5:a0100393ed53b8c28949b017afa5ef15
1.2 GB Download
md5:7f4cbde0f75618b58f92ee10e1fbd086
395.5 MB Download

Additional details

Related works

Is supplemented by
Dataset: 10.5281/zenodo.6794924 (DOI)

References

  • Acoustic models of Brazilian Portuguese Speech based on Neural Transformers - Gauy, Marcelo e Finger, Marcelo 2022
  • Deep learning against COVID-19: Respiratory insufficiency detection in Brazilian Portuguese Speech - Casanova, Edresson e Gris, Lucas e Camargo, Augusto e Silva, Daniel e Gazzola, Murilo e Sabino, Ester e Levin, Anna e Candido Jr, Arnaldo e Aluisio, Sandra e Finger, Marcelo 2021
  • Detecting Respiratory Insufficiency via Voice Analysis: the SPIRA Project - Aluisio, Sandra e Camargo, Augusto e Candido Jr, Arnaldo e Fernandes Jr, Ricardo e Casanova, Edresson e Gris, Lucas e Svartman, Flaviane e Silva, Daniel e Ferreira, Renato e Finger, Marcelo e Goldman, Alfredo e Spazzapan, Evelyn e Leyton, Pedro e Berti, Larissa e Levin, Anna e Gauy, Marcelo e Martins, Marcus e Quirino, Henrique e Queiroz, Marcelo e Raposo, Beatriz e Sabino, Ester 2022
  • Detecting respiratory insufficiency by voice analysis: the SPIRA project (2021) - Finger, Marcelo e Aluisio, Sandra e Spazzapan, Evelyn e Berti, Larissa e Camargo, Augusto e Candido Jr, Arnaldo e Casanova, Edresson e Svartman, Flaviane e Ferreira, Renato e Fernandes Jr, Ricardo e Goldman, Alfredo e Gris, Lucas e Leyton, Pedro e Levin, Anna e Martins, Marcus e Queiroz, Marcelo e Quirino, Henrique e Medeiros, Beatriz e Sabino, Ester e Silva, Daniel