Cadenza Challenge ICASSP 2024 (ICASSP24): Submission audio samples for the ICASSP 2024 Cadenza Grand Challenge - Baseline systems

Cox, Trevor; Roa Dabike, Gerardo

doi:10.5281/zenodo.13285031

Published August 9, 2024 | Version V1.0.0

Dataset Open

Cadenza Challenge ICASSP 2024 (ICASSP24): Submission audio samples for the ICASSP 2024 Cadenza Grand Challenge - Baseline systems

1. University of Salford
2. University of Sheffield

Contributors

Data curators:

1. University of Nottingham
2. University of Salford
3. University of Leeds
4. University of Sheffield

Cadenza

This is the baseline submission data for the ICASSP 2024 Cadenza Grand Challenge (ICASSP24).

The Cadenza Challenges are improving music production and processing for people with a hearing loss. According to The World Health Organization, 430 million people worldwide have a disabling hearing loss. Studies show that not being able to understand lyrics is an important problem to tackle for those with hearing loss. Consequently, this task is about improving the intelligibility of lyrics when listening to pop/rock over headphones. But this needs to be done without losing too much audio quality - you can't improve intelligibility just by turning off the rest of the band! We will be using one metric for intelligibility and another metric for audio quality, and giving you different targets to explore the balance between these metrics.

Please see the Cadenza website for a full description of the data

Technical info

This dataset contains the baselines submission audio signals for the ICASSP24 challenge. The signals correspond to 10-second consecutive segments of the MUSDB18-HQ test split. The signals were processed according the ICASSP24 requirements. Please refer to the Cadenza challenge website and to the paper for details.

Description of files:

submission_T001.zip: package containing the audio signals of Baseline 1
submission_T002.zip: package containing the audio signals of Baseline 2
gains.json: Json file with all posible gain combinations.
head_loudspeaker_positions.json: Json file with the different combination of head rotations
listeners.test.json: Json file with the listeners audiograms
scenes.test.json: Json file with the scenes descriptions
scene_listeners.test.json: Json file with the list of listeners to process per scene
musdb18.test.json: Json file with the description of the MUSDB18-HQ test split
HAAQI_scores.zip: ZIP file containing one CSV per Team with HAAQI scores

The audio signals are organised as:

enhanced_signals/scene_<Scene_ID>_<Listener_ID>_remix.flac

where:

Scene_ID: is the unique id to identify each scene.
Listener_ID: 53 unique ids to identify each listener.

Other

Cite as:

G. Roa-Dabike, M. A. Akeroyd, S. Bannister, J. P. Barker, T. J. Cox, B. Fazenda, J. Firth, S. Graetzer, A. Greasley, R. R. Vos and W. M. Whitmer, "The First Cadenza Challenges: Using Machine Learning Competitions to Improve Music for Listeners With a Hearing Loss," in IEEE Open Journal of Signal Processing.

Notes

This repository only includes the submission samples for both baselines. If you need access to the submission samples for all submissions, please contact us at cadenzachallengecontact@gmail.com.

All submissions description:

19 packages (including baselines)
A total size of 384 GB (including baselines)
19,200 signals per team

Files

gains.json

Files (39.3 GB)

Name	Size	Download all
gains.json md5:9ab13633eb9584cc3dcd8baae9826f79	118.6 kB	Preview Download
HAAQI_scores.zip md5:e1dfcc5a686e1f44ce4b20719e738492	40.6 MB	Preview Download
head_loudspeaker_positions.json md5:3944cbfd2853563b8198c58614088e06	19.5 kB	Preview Download
listeners.test.json md5:8bd264e7fcf0c44028ad05ab28d8e11c	20.6 kB	Preview Download
musdb18.test.json md5:40e05f6b2e5fcb755c21309215d5c005	9.9 kB	Preview Download
scene_listeners.test.json md5:94b5bcdc2b37e0dab4b1ee7672e54e19	352.3 kB	Preview Download
scenes.test.json md5:e9292223fb8a2fca5f069154a22f6cf7	193.3 kB	Preview Download
submission_T001.zip md5:311a7ec50711ba2f13a74fddcde28a08	19.6 GB	Preview Download
submission_T002.zip md5:ee7f4a504c50af016bc8c60d6f3e158b	19.7 GB	Preview Download

Additional details

UK Research and Innovation
EnhanceMusic: Machine Learning Challenges to Revolutionise Music Listening for People with Hearing Loss EP/W019434/1

Repository URL: https://github.com/claritychallenge/clarity
Programming language: Python

	All versions	This version
Views	179	179
Downloads	354	354
Data volume	1.8 TB	1.8 TB

Contributors

Data curators:

Cadenza

gains.json

Files (39.3 GB)

Funding

Software

Cadenza Challenge ICASSP 2024 (ICASSP24): Submission audio samples for the ICASSP 2024 Cadenza Grand Challenge - Baseline systems

Authors/Creators

Contributors

Data curators:

Description

Cadenza

Technical info

Other

Cite as:

Notes

Files

gains.json

Files (39.3 GB)

Additional details

Funding

Software