There is a newer version of this record available.

Dataset Open Access

HEAR NeurIPS 2021 Datasets (Holistic Evaluation of Audio Representations)

Joseph Turian; Jordie Shier; Humair Raj Khan

NOTES:

* On Zenodo, please make sure you download datasets version 2021.2, not earlier versions.

* The datasets have different open licenses. Please see LICENSE.txt for each individual dataset's license.

These are the evaluation tasks for the HEAR (Holistic Evaluation of Audio Representations) 2021 NeurIPS challenge.

The aim of this challenge is to develop a general-purpose audio representation that provides a strong basis for learning in a wide variety of tasks and scenarios. The HEAR 2021 challenge invites you to create an audio embedding that is as holistic as the human ear, i.e., one that performs well across a variety of everyday domains: What approach best generalizes to a wide range of downstream audio tasks without fine-tuning?

HEAR 2021 evaluates audio representations using a benchmark suite across a variety of domains, including speech, environmental sound, and music.

For more information, see the HEAR 2021 website and upcoming PMLR journal article.

Datasets were all normalized to a common human-readable format using hearpreprocess. Until 2022-04-01, datasets will be mirrored at data.neuralaudio.ai. This Zenodo mirror has all audio task but only at 48000Hz sampling rate. For other sampling rates (16000, 22050, 32000, 44100), please download files (requester pays) from Google Storage gs://hear2021-archive/tasks/ or AWS s3://hear2021-archive/tasks/

Files (100.8 GB)
Name Size
hear2021-beehive_states_fold0-v2-full-48000.tar.gz
md5:f9e045f9b2ddf5643edc1143304c80aa
20.8 GB Download
hear2021-beehive_states_fold1-v2-full-48000.tar.gz
md5:8bced7ce8336bf23fd61c61f04aee109
20.8 GB Download
hear2021-beijing_opera-v1.0-hear2021-full-48000.tar.gz
md5:33421c23e26151fe93ef6b312f87c8ec
36.4 MB Download
hear2021-dcase2016_task2-hear2021-full-48000.tar.gz
md5:a6d4026b6526372a0e5b0c83743110bb
518.0 MB Download
hear2021-esc50-v2.0.0-full-48000.tar.gz
md5:6e9b3b01d309af7e660b9a8661f41ab6
713.9 MB Download
hear2021-fsd50k-v1.0-full-48000.tar.gz
md5:fc41f8cd7f874b240936ad4876431b8f
27.1 GB Download
hear2021-gunshot_triangulation-v1.0-full-48000.tar.gz
md5:44da3ab63bd0eb24fcbb93fe70495c12
8.1 MB Download
hear2021-libricount-v1.0.0-hear2021-full-48000.tar.gz
md5:d1fc5d4bba6036aacfb489b471ccb23f
2.5 GB Download
hear2021-maestro-v3.0.0-5h-48000.tar.gz
md5:e2245138705d64afe36ba3488e2f63dd
1.7 GB Download
hear2021-mridangam_stroke-v1.5-full-48000.tar.gz
md5:133a78bd4909c507e4c6400eabdb2545
179.1 MB Download
hear2021-mridangam_tonic-v1.5-full-48000.tar.gz
md5:c0076fab256a7822054f36891eabc25b
179.1 MB Download
hear2021-nsynth_pitch-v2.2.3-50h-48000.tar.gz
md5:5946b7d547772f2ee542e34d357b7b66
12.1 GB Download
hear2021-nsynth_pitch-v2.2.3-5h-48000.tar.gz
md5:ad3f7de0c52b5b42cd6edf26db74e738
1.2 GB Download
hear2021-speech_commands-v0.0.2-5h-48000.tar.gz
md5:af44c67aa66a88a1841e2180486c235f
1.4 GB Download
hear2021-speech_commands-v0.0.2-full-48000.tar.gz
md5:2a0225bef9bdc342300b19e5f69f657e
6.3 GB Download
hear2021-tfds_crema_d-1.0.0-full-48000.tar.gz
md5:8a80e180e44f01a5a43c962de92c1e12
1.5 GB Download
hear2021-tfds_gtzan-1.0.0-full-48000.tar.gz
md5:9a5efe50d59a6d0dd6c71a38ccb4fea5
2.6 GB Download
hear2021-tfds_gtzan_music_speech-1.0.0-full-48000.tar.gz
md5:afe46656bbedfcbb2591e7b8cc079cf9
317.6 MB Download
hear2021-vox_lingua_top10-hear2021-full-48000.tar.gz
md5:7fdbe75cbae48f30374c9186a3b841cf
856.8 MB Download
LICENSE.txt
md5:72afb63a76d1fc3e307653ef3827a36d
3.7 kB Download
315
417
views
downloads
All versions This version
Views 315101
Downloads 41770
Data volume 2.3 TB353.9 GB
Unique views 23992
Unique downloads 13641

Share

Cite as