Nonspeech7k dataset
- 1. South China University of Technology
Contributors
Related person:
Researcher:
Supervisor:
- 1. South China University of Technology
Description
The dataset consists of 7,014 files delivered as 32kHz, mono audio files in .wav format and divided into train and test sets. The train set consists of 6,289, and the test set consists of 725 files. The files were strongly manually annotated with a single ground-truth label. The length of each file is from 500 milliseconds to 4 seconds.
The dataset is only allowed for non-commercial and academic research purposes under the creative commons (CC BY-NC-SA 4.0) license. If you use the dataset, please cite our paper and acknowledge the source(freesound.org, Youtube, and Aigei). More details about the Nonspeech7k dataset are available in our article.
Article title: "Nonspeech7k dataset: Classification and analysis of human nonspeech sound"
Files
metadata of test set.csv
Files
(2.5 GB)
Name | Size | Download all |
---|---|---|
md5:872465ca83b24dc80f1483ff50dbeef3
|
51.8 kB | Preview Download |
md5:cf8f7a57d49c43dd4e5c05c3707291d3
|
478.5 kB | Preview Download |
md5:791585589019d07e01ef427675f75eca
|
221.8 MB | Preview Download |
md5:6841287789e0cddb60174eb7bbde8d64
|
2.3 GB | Preview Download |
md5:16fe64c4bcc270cb211762d22602eaaf
|
1.4 kB | Preview Download |
Additional details
Related works
- Is cited by
- Journal article: 10.1049/sil2.12233 (DOI)