Published September 3, 2025 | Version v1
Dataset Open

1st Clarity Prediction Challenge (CPC1) dataset for hearing aid speech intelligibility prediction

  • 1. ROR icon University of Sheffield
  • 1. University of Nottingham
  • 2. ROR icon University of Salford
  • 3. ROR icon Cardiff University
  • 4. University of Nottingham School of Medicine
  • 5. ROR icon Federico Santa María Technical University

Description

This dataset was created for the first Clarity Prediction Challenge (CPC1), which investigated the prediction of speech intelligibility for hearing-aid users in noisy acoustic environments. The challenge is now complete, but the dataset is made available for ongoing research. It can be used to develop and evaluate algorithms that estimate speech intelligibility across a range of acoustic and hearing-aid conditions.

The release includes audio signals, listener metadata, and supporting documentation. These materials enable reproducible evaluation of past challenge submissions as well as benchmarking of new approaches.

Further information about the challenge, including background, task definitions, and baseline systems, is available at: https://claritychallenge.org/docs/cpc1/cpc1_intro

Files

Files (20.7 GB)

Name Size Download all
md5:db62ba9c018dca184453fc60901519df
129.4 kB Download
md5:bfae71cbc6e4b7f289f778bf4ddd58c4
6.5 GB Download
md5:6d68b89b16d8da8fe54989df8144ec14
14.2 GB Download

Additional details

Related works

Is described by
Conference paper: 10.21437/interspeech.2022-10821 (DOI)

Software

Repository URL
https://github.com/claritychallenge/clarity
Programming language
Python
Development Status
Active