Published June 13, 2023 | Version 1
Dataset Open

Nonspeech7k dataset

  • 1. South China University of Technology

Contributors

Related person:

Supervisor:

  • 1. South China University of Technology

Description

The dataset consists of 7,014 files delivered as 32kHz, mono audio files in .wav format and divided into train and test sets. The train set consists of 6,289, and the test set consists of 725 files. The files were strongly manually annotated with a single ground-truth label. The length of each file is from 500 milliseconds to 4 seconds.

The dataset is only allowed for non-commercial and academic research purposes under the creative commons (CC BY-NC-SA 4.0) license. If you use the dataset, please cite our paper and acknowledge the source(freesound.org, Youtube, and Aigei). More details about the Nonspeech7k dataset are available in our article.

Article title: "Nonspeech7k dataset: Classification and analysis of human nonspeech sound"

Files

metadata of test set.csv

Files (2.5 GB)

Name Size Download all
md5:872465ca83b24dc80f1483ff50dbeef3
51.8 kB Preview Download
md5:cf8f7a57d49c43dd4e5c05c3707291d3
478.5 kB Preview Download
md5:791585589019d07e01ef427675f75eca
221.8 MB Preview Download
md5:6841287789e0cddb60174eb7bbde8d64
2.3 GB Preview Download
md5:16fe64c4bcc270cb211762d22602eaaf
1.4 kB Preview Download

Additional details

Related works

Is cited by
Journal article: 10.1049/sil2.12233 (DOI)