DCASE 2023 Challenge Task 7
Creators
- 1. Gaudio Lab, inc.
- 2. Gaudio Lab, inc. / KAIST
- 3. Carnegie Mellon University
- 4. New York University
- 5. Doshisha University
- 6. Ritsumeikan University
- 7. CNRS, Ecole Centrale Nantes, Nantes Université
- 8. The University of Tokyo
Description
Description
These audio files are the dataset and generated sound samples from the submission systems for the DCASE 2023 Challenge Task 7 "Foley Sound Synthesis. The audio data consists of sound excerpts which are divided into 7 classes. Each sound excerpt is a mono 16-bit 22,050 Hz 4-second audio. The classes of audio are as follows.
- DogBark
- Footstep
- GunShot
- Keyboard
- MovingMotorVehicle
- Rain
- Sneeze/Cough
Each submission consists of 700 sound excerpts in each category. For more details, please check the challenge website.
A portion of the sounds in the data set were kindly provided by permission of the BBC under the condition that the sounds are used for this DCASE challenge and are only used for research purposes. You can check the original source of each sound in 'DevMeta.csv' and 'EvalMeta.csv' located in 'DCASE_2023_Challenge_Task_7_Dataset'.
Directory structure
DCASE_2023_Challenge_Task_7_Dataset
- /dev
- /dog_bark
- /....wav
- ....
- /sneeze_cough
- /eval
- /dog_bark
- /....wav
- ....
- /sneeze_cough
DCASE_2023_Challenge_Task_7_Baseline
- /dog_bark
- /....wav
- ....
- /sneeze_cough
DCASE_2023_Challenge_Task_7_Submission
- /Submissions
- /A
- /TASys02
- /dog_bark
- /00.wav
- /....wav
- /99.wav
- ....
- /sneeze_cough
- ....
- /TASys11
- /B
- /TBSys01
- ....
- /TBSys31
Contact
- Keunwoo Choi, keunwoo@gaudiolab.com
- Jaekwon Im, jakeoneijk@kaist.ac.kr