WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System
Creators
Contributors
Data collector:
Description
A new large language model (LLM)-powered dataset namely wild domestic environment sound event detection (WildDESED). It is crafted as an extension to the original DESED dataset to reflect diverse acoustic variability and complex noises in home settings. We leveraged LLMs to generate eight different domestic scenarios based on target sound categories of the DESED dataset. Then we enriched the scenarios with a carefully tailored mixture of noises selected from AudioSet and ensured no overlap with target sound. We consider widely popular convolutional neural recurrent network to study WildDESED dataset, which depicts its challenging nature. We then apply curriculum learning by gradually increasing noise complexity to enhance the model's generalization capabilities across various noise levels.
Files
noise_train-5db.zip
Files
(15.4 GB)
Name | Size | Download all |
---|---|---|
md5:3580f6296612c2c18f3cd028307fbab2
|
2.9 GB | Preview Download |
md5:b41a0ffe0229a46ea09336483c94fb2c
|
2.8 GB | Preview Download |
md5:f5311c492ecbd3ac5c87f127493aec88
|
2.7 GB | Preview Download |
md5:d86008147e7973fd3e57fc774a11b374
|
2.8 GB | Preview Download |
md5:c4b24e38b2a0f689451a7286083bc96f
|
2.8 GB | Preview Download |
md5:7e16b9a550f68405342c8d71fb3636d2
|
335.8 MB | Preview Download |
md5:9766b4d7e19d6917458397317bfd0e49
|
330.8 MB | Preview Download |
md5:ebe9e8598aaa4f8515cc5251e5c16a38
|
320.6 MB | Preview Download |
md5:01c87a4d8ff71a9029385fd08ad751ae
|
325.3 MB | Preview Download |
md5:e5c49cc776f99334113e643476c78391
|
2.6 MB | Preview Download |
Additional details
Identifiers
- arXiv
- arXiv:2407.03656
Dates
- Created
-
2024-10-10
Software
- Repository URL
- https://github.com/swagshaw/WildDESED
- Development Status
- Active