Cadenza Challenge ICASSP 2024 (ICASSP24): Submission audio samples for the ICASSP 2024 Cadenza Grand Challenge - Baseline systems
Authors/Creators
Contributors
Description
Cadenza
This is the baseline submission data for the ICASSP 2024 Cadenza Grand Challenge (ICASSP24).
The Cadenza Challenges are improving music production and processing for people with a hearing loss. According to The World Health Organization, 430 million people worldwide have a disabling hearing loss. Studies show that not being able to understand lyrics is an important problem to tackle for those with hearing loss. Consequently, this task is about improving the intelligibility of lyrics when listening to pop/rock over headphones. But this needs to be done without losing too much audio quality - you can't improve intelligibility just by turning off the rest of the band! We will be using one metric for intelligibility and another metric for audio quality, and giving you different targets to explore the balance between these metrics.
Please see the Cadenza website for a full description of the data
Technical info
This dataset contains the baselines submission audio signals for the ICASSP24 challenge. The signals correspond to 10-second consecutive segments of the MUSDB18-HQ test split. The signals were processed according the ICASSP24 requirements. Please refer to the Cadenza challenge website and to the paper for details.
Description of files:
- submission_T001.zip: package containing the audio signals of Baseline 1
- submission_T002.zip: package containing the audio signals of Baseline 2
- gains.json: Json file with all posible gain combinations.
- head_loudspeaker_positions.json: Json file with the different combination of head rotations
- listeners.test.json: Json file with the listeners audiograms
- scenes.test.json: Json file with the scenes descriptions
- scene_listeners.test.json: Json file with the list of listeners to process per scene
- musdb18.test.json: Json file with the description of the MUSDB18-HQ test split
- HAAQI_scores.zip: ZIP file containing one CSV per Team with HAAQI scores
The audio signals are organised as:
enhanced_signals/scene_<Scene_ID>_<Listener_ID>_remix.flac
where:
- Scene_ID: is the unique id to identify each scene.
- Listener_ID: 53 unique ids to identify each listener.
Other
Cite as:
G. Roa-Dabike, M. A. Akeroyd, S. Bannister, J. P. Barker, T. J. Cox, B. Fazenda, J. Firth, S. Graetzer, A. Greasley, R. R. Vos and W. M. Whitmer, "The First Cadenza Challenges: Using Machine Learning Competitions to Improve Music for Listeners With a Hearing Loss," in IEEE Open Journal of Signal Processing.
Notes
Files
gains.json
Files
(39.3 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:9ab13633eb9584cc3dcd8baae9826f79
|
118.6 kB | Preview Download |
|
md5:e1dfcc5a686e1f44ce4b20719e738492
|
40.6 MB | Preview Download |
|
md5:3944cbfd2853563b8198c58614088e06
|
19.5 kB | Preview Download |
|
md5:8bd264e7fcf0c44028ad05ab28d8e11c
|
20.6 kB | Preview Download |
|
md5:40e05f6b2e5fcb755c21309215d5c005
|
9.9 kB | Preview Download |
|
md5:94b5bcdc2b37e0dab4b1ee7672e54e19
|
352.3 kB | Preview Download |
|
md5:e9292223fb8a2fca5f069154a22f6cf7
|
193.3 kB | Preview Download |
|
md5:311a7ec50711ba2f13a74fddcde28a08
|
19.6 GB | Preview Download |
|
md5:ee7f4a504c50af016bc8c60d6f3e158b
|
19.7 GB | Preview Download |
Additional details
Funding
- UK Research and Innovation
- EnhanceMusic: Machine Learning Challenges to Revolutionise Music Listening for People with Hearing Loss EP/W019434/1
Software
- Repository URL
- https://github.com/claritychallenge/clarity
- Programming language
- Python