Nonspeech7k dataset

Muhammad Mamunur Rashid; Guiqing Li *; Chengrui Du

doi:10.5281/zenodo.6967442

Published June 13, 2023 | Version 1

Dataset Open

Nonspeech7k dataset

1. South China University of Technology

Contributors

1. South China University of Technology

The dataset consists of 7,014 files delivered as 32kHz, mono audio files in .wav format and divided into train and test sets. The train set consists of 6,289, and the test set consists of 725 files. The files were strongly manually annotated with a single ground-truth label. The length of each file is from 500 milliseconds to 4 seconds.

The dataset is only allowed for non-commercial and academic research purposes under the creative commons (CC BY-NC-SA 4.0) license. If you use the dataset, please cite our paper and acknowledge the source(freesound.org, Youtube, and Aigei). More details about the Nonspeech7k dataset are available in our article.

Article title: "Nonspeech7k dataset: Classification and analysis of human nonspeech sound"

Files

metadata of test set.csv

Files (2.5 GB)

Name	Size
metadata of test set.csv md5:872465ca83b24dc80f1483ff50dbeef3	51.8 kB	Preview Download
metadata of train set .csv md5:cf8f7a57d49c43dd4e5c05c3707291d3	478.5 kB	Preview Download
test.zip md5:791585589019d07e01ef427675f75eca	221.8 MB	Preview Download
train.zip md5:6841287789e0cddb60174eb7bbde8d64	2.3 GB	Preview Download
youtube ID vs link .TXT md5:16fe64c4bcc270cb211762d22602eaaf	1.4 kB	Preview Download

Additional details

Is cited by: Journal article: 10.1049/sil2.12233 (DOI)

	All versions	This version
Views	6,144	6,101
Downloads	8,961	8,914
Data volume	7.4 TB	7.3 TB

Nonspeech7k dataset

Authors/Creators

Contributors

Related person:

Researcher:

Supervisor:

Description

Files

metadata of test set.csv

Files (2.5 GB)

Additional details

Related works