Video/Audio Open Access

Test Database for the Assessment of Immersive Audio Systems

Ogden, Harry; Stubbs, Jess; Kearney, Gavin

This repository contains a new library of listening material, for the testing of immersive audio systems, that includes synthetic sound sources, speech recordings and short musical and instrumental performances.

Evaluation of perceived audio quality is an essential part of spatial audio system design, where listening tests help to reveal any
spatial and timbral distortions that occur. Selection of audio stimuli constitutes an important part of listening test methods, as different stimuli will reveal specific properties of the perceived audio. A wide range of listening test material is therefore required, from which the most appropriate stimuli can be chosen based on the context of the test. For researchers in the field of immersive audio, availability of such materials can be sparse due to the differing requirements of surround sound and ambisonic testing. To this end a new test database has been developed, for use in the spatial and timbral evaluation of immersive audio systems.

---

The data is organised as follows:

Source Files
- Source_2-Pop (1kHz tone, one frame long (25ms or 50ms))
- Source_3rdOctaveBandPinkNoise (10 & 60 second durations, frequency bands; 32, 64, 125, 250, 500, 1k, 2k, 4k, 8k, 16kHz)
- Source_500-2000Hz_PinkNoise (Pink noise with frequencies below 500Hz removed & cut-off at 2kHz)
- Source_AcousticGuitar&Vocals (4 original pieces consisting of multiple guitar, vocal, drum & shaker tracks)
- Source_ConversationalSpeech (selection of short conversations & passages recorded in an anecohic chamber and reverberant classroom)
- Source_DTMF_Tones (Tone pairs consisting of lower & higher frequencies with durations of 1s, 10s, 100ms & 200ms)
- Source_GreenwichTimeSignal (series of five 0.1 second, 1 kHz tone bursts separated by 0.9 seconds of silence concluded by a 0.5 second 1 kHz tone)
- Source_PinkNoise (durations of 1s, 10s, 60s, 100ms & 200ms)
- Source_SinePureTones (1s, 10s, 60s, 100ms & 200ms durations, frequencies; 20, 32, 64, 125, 250, 440, 500, 1k, 2k, 4k,  8k, 16k, 20kHz)
- Source_SpeechMaterial(Female) (includes sentences & passages; speaker positions & names; azimuth & elevation angles (-180 to +180); Numbers, alphabet & assorted audio terms)
- Source_SpeechMaterial(Male) (includes sentences & passages; speaker positions & names; azimuth & elevation angles (-180 to +180); Numbers, alphabet & assorted audio terms)
- Source_SpeechMaterial(Mandarin) (includes only sentences & passages)
- Source_WhiteNoise (durations of 1s, 10s, 60s, 100ms & 200ms)

Ambisonically Encoded Files
- XOrder_3rdOctPinkNoise (X, where X = 1st, 3rd, 5th, 7th, Order encoded 1/3 Octave Band Pink noise files) 
    - Cube (sources encoded to cube face positions)
        - 3rdOctPinkNoise_32Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 32Hz center frequency)
        - 3rdOctPinkNoise_64Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 64Hz center frequency)
        - 3rdOctPinkNoise_125Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 125Hz center frequency)
        - 3rdOctPinkNoise_250Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 250Hz center frequency)
        - 3rdOctPinkNoise_500Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 500Hz center frequency)
        - 3rdOctPinkNoise_1000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 1kHz center frequency)
        - 3rdOctPinkNoise_2000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 2kHz center frequency)
        - 3rdOctPinkNoise_4000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 4kHz center frequency)
        - 3rdOctPinkNoise_8000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 8kHz center frequency)
        - 3rdOctPinkNoise_16000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 16kHz center frequency)
    - Dodecahedron (sources encoded to dodecahedron face positions)
        - 3rdOctPinkNoise_32Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 32Hz center frequency)
        - 3rdOctPinkNoise_64Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 64Hz center frequency)
        - 3rdOctPinkNoise_125Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 125Hz center frequency)
        - 3rdOctPinkNoise_250Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 250Hz center frequency)
        - 3rdOctPinkNoise_500Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 500Hz center frequency)
        - 3rdOctPinkNoise_1000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 1kHz center frequency)
        - 3rdOctPinkNoise_2000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 2kHz center frequency)
        - 3rdOctPinkNoise_4000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 4kHz center frequency)
        - 3rdOctPinkNoise_8000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 8kHz center frequency)
        - 3rdOctPinkNoise_16000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 16kHz center frequency)
- XOrder_500-2000Hz_PinkNoise (X Order encoded 500-2000Hz Pink noise files)
    - Cube (sources encoded to cube face positions)
        - 500-2000Hz_PinkNoise_-20dBFS_1s_48kHz_24Bit (1 second)
        - 500-2000Hz_PinkNoise_-20dBFS_10s_48kHz_24Bit (10 seconds)
        - 500-2000Hz_PinkNoise_-20dBFS_60s_48kHz_24Bit (60 seconds)
        - 500-2000Hz_PinkNoise_-20dBFS_100ms_48kHz_24Bit (100 milliseconds)
        - 500-2000Hz_PinkNoise_-20dBFS_200ms_48kHz_24Bit (200 milliseconds)
    - Dodecahedron (sources encoded to dodecahedron face positions)
        - 500-2000Hz_PinkNoise_-20dBFS_1s_48kHz_24Bit (1 second)
        - 500-2000Hz_PinkNoise_-20dBFS_10s_48kHz_24Bit (10 seconds)
        - 500-2000Hz_PinkNoise_-20dBFS_60s_48kHz_24Bit (60 seconds)
        - 500-2000Hz_PinkNoise_-20dBFS_100ms_48kHz_24Bit (100 milliseconds)
        - 500-2000Hz_PinkNoise_-20dBFS_200ms_48kHz_24Bit (200 milliseconds)
- XOrder_BroadcastSources (X Order encoded 2-pip, GTS & DTMF tone files)
    - 2-Pop_-20dBFS_25ms_48kHz_24Bit (encoded to cube face positions)
    - 2-Pop_-20dBFS_50ms_48kHz_24Bit (encoded to cube face positions)
    - DTMF_Tones_-20dBFS_1s_48kHz_24Bit (1 second, encoded to front center position)
    - DTMF_Tones_-20dBFS_10s_48kHz_24Bit (10 seconds, encoded to front center position)
    - DTMF_Tones_-20dBFS_100ms_48kHz_24Bit (100 milliseconds, encoded to front center position)
    - DTMF_Tones_-20dBFS_200ms_48kHz_24Bit (200 milliseconds, encoded to front center position)
    - GTS_Full_-20dBFS_48kHz_24Bit (encoded to cube face positions)
- XOrder_ExampleTestFiles (X Order encoded Pink noise announced example test files e.g. "Front Center" *Noise burst at front center*)
    - 1Second (announced 1 second pink noise encoded to ITU-R BS.2159-4 and SMPTE 2603 speaker positions)
    - 100ms_3Bursts (announced 3 bursts of 100ms pink noise encoded to ITU-R BS.2159-4 and SMPTE 2603 speaker positions)
    - 200ms_3Bursts (announced 3 bursts of 200ms pink noise encoded to ITU-R BS.2159-4 and SMPTE 2603 speaker positions)
- XOrder_MovingSources (X Order encoded noise sources that circle azimuth/elevation at specified speeds)
    - PinkNoise_-20dBFS_60s_48kHz_24Bit (60 seconds, azimuth & elevation pink noise at 45, 90 & 180 degrees per second)
    - WhiteNoise_-20dBFS_60s_48kHz_24Bit (60 seconds, azimuth & elevation white noise at 45, 90 & 180 degrees per second)
- XOrder_PinkNoise (X Order encoded Pink noise files)
    - Cube (sources encoded to cube face positions)
        - PinkNoise_-20dBFS_1s_48kHz_24Bit (1 second)
        - PinkNoise_-20dBFS_10s_48kHz_24Bit (10 seconds)
        - PinkNoise_-20dBFS_60s_48kHz_24Bit (60 seconds)
        - PinkNoise_-20dBFS_100ms_48kHz_24Bit (100 milliseconds)
        - PinkNoise_-20dBFS_200ms_48kHz_24Bit (200 milliseconds)
    - Dodecahedron (sources encoded to dodecahedron face positions)
        - PinkNoise_-20dBFS_1s_48kHz_24Bit (1 second)
        - PinkNoise_-20dBFS_10s_48kHz_24Bit (10 seconds)
        - PinkNoise_-20dBFS_60s_48kHz_24Bit (60 seconds)
        - PinkNoise_-20dBFS_100ms_48kHz_24Bit (100 milliseconds)
        - PinkNoise_-20dBFS_200ms_48kHz_24Bit (200 milliseconds)
- XOrder_WhiteNoise (X Order encoded White noise files)
    - Cube (sources encoded to cube face positions)
        - WhiteNoise_-20dBFS_1s_48kHz_24Bit (1 second)
        - WhiteNoise_-20dBFS_10s_48kHz_24Bit (10 seconds)
        - WhiteNoise_-20dBFS_60s_48kHz_24Bit (60 seconds)
        - WhiteNoise_-20dBFS_100ms_48kHz_24Bit (100 milliseconds)
        - WhiteNoise_-20dBFS_200ms_48kHz_24Bit (200 milliseconds)
    - Dodecahedron (sources encoded to dodecahedron face positions)
        - WhiteNoise_-20dBFS_1s_48kHz_24Bit (1 second)
        - WhiteNoise_-20dBFS_10s_48kHz_24Bit (10 seconds)
        - WhiteNoise_-20dBFS_60s_48kHz_24Bit (60 seconds)
        - WhiteNoise_-20dBFS_100ms_48kHz_24Bit (100 milliseconds)
        - WhiteNoise_-20dBFS_200ms_48kHz_24Bit (200 milliseconds)

---

For any enquiries regarding the data please email: ho581@york.ac.uk

Data produced by Harry Ogden at the Audio Lab, Department of Electronics Engineering, University of York
Contact: ho581@york.ac.uk

Funding was provided by UK Engineering and Physical Sciences Research Council (EPSRC), the Department of Electronic Engineering at the University of York.

Files (69.8 GB)
Name Size
1stOrder_3rdOctPinkNoise.zip
md5:e8958d4064cdee93227a0f6a287f584a
738.0 MB Download
1stOrder_500-2000Hz_PinkNoise.zip
md5:8b4b65525d5c12b466e64dfcf508b611
535.7 MB Download
1stOrder_BroadcastSources.zip
md5:56322feb71f15f2302fdfd48301d63ac
40.4 MB Download
1stOrder_ExampleTestFiles.zip
md5:049cb0a88f70cb872e0d2f96a92ac2b6
115.2 MB Download
1stOrder_MovingSources.zip
md5:ff165d6231fd269228291a561725d9d7
335.3 MB Download
1stOrder_PinkNoise.zip
md5:fcc3acbf2612851bf4e1f2943688497a
534.6 MB Download
1stOrder_WhiteNoise.zip
md5:86fb5f7ef84adcd0a0e38dc4d4c93172
541.8 MB Download
3rdOrder_3rdOctPinkNoise.zip
md5:3566b31006b32f8f6a071a60bc462945
2.6 GB Download
3rdOrder_500-2000Hz_PinkNoise.zip
md5:d13fc4bc664e1719137cfabb75fada7c
1.9 GB Download
3rdOrder_BroadcastSources.zip
md5:2acc706543a574885d8777bace5752b4
179.5 MB Download
3rdOrder_ExampleTestFiles.zip
md5:92736e5ab7e2b0f4f119ba9ae246cca6
410.3 MB Download
3rdOrder_MovingSources.zip
md5:7e01bf1b601a99fd3e81604d29e92195
1.1 GB Download
3rdOrder_PinkNoise.zip
md5:3bd8e4d07d2629b5edce5c6ba34d84e0
1.3 GB Download
3rdOrder_WhiteNoise.zip
md5:f432be2682df844b70286a29a94a8995
1.5 GB Download
5thOrder_3rdOctPinkNoise.zip
md5:dde667d0478b9356d9628ddd94436591
5.5 GB Download
5thOrder_500-2000Hz_PinkNoise.zip
md5:bd00e676e1e4cd456d44f826e2a525b1
4.0 GB Download
5thOrder_BroadcastSources.zip
md5:df075da6c2f201f67bd238137312515d
391.0 MB Download
5thOrder_ExampleTestFiles.zip
md5:34648d640f75e8162507e622c7524c7b
868.1 MB Download
5thOrder_MovingSources.zip
md5:1a9422bfa91e8ede5c5676dc8ca44711
2.4 GB Download
5thOrder_PinkNoise.zip
md5:53222fa38478456a98738a62e1c56fad
4.0 GB Download
5thOrder_WhiteNoise.zip
md5:a62cf6539595c05b4e2938c0e833eaaa
4.1 GB Download
7thOrder_3rdOctPinkNoise.zip
md5:045931db5e95bd79f98279a0187e813b
9.5 GB Download
7thOrder_500-2000Hz_PinkNoise.zip
md5:b2bc68c65b62a6e45d5a77b477d905b2
6.8 GB Download
7thOrder_BroadcastSources.zip
md5:95bf472e66dd181dcb88f987ab61ee93
662.8 MB Download
7thOrder_ExampleTestFiles.zip
md5:88fb77b5a5549098ad03625122c5e513
1.5 GB Download
7thOrder_MovingSources.zip
md5:476836ddba7c7b142332417d72ed9d1f
3.4 GB Download
7thOrder_PinkNoise.zip
md5:ff63b69c3050aadf2e485f0e75875671
6.8 GB Download
7thOrder_WhiteNoise.zip
md5:efda5321b32badc12950d28ed07b2feb
6.9 GB Download
Source_2-Pop.zip
md5:0bc215d763572603e362cc47c41e92f1
2.0 kB Download
Source_3rdOctaveBandPinkNoise.zip
md5:8614bd9a133d241d84d6de843622a6b4
97.6 MB Download
Source_500-2000Hz_PinkNoise.zip
md5:3fd88bc8bde58dd114007ccaf7a837ee
10.1 MB Download
Source_AcousticGuitar&Vocals.zip
md5:d65a5c7e8014f599bd0a04b9a6069680
135.0 MB Download
Source_ConversationalSpeech.zip
md5:b0fc1c44c31db954edf595e7c5da4b3c
350.0 MB Download
Source_DTMF_Tones.zip
md5:799243fee235d0d9006be2ee48f13fba
25.3 MB Download
Source_GreenwichTimeSignal.zip
md5:ea4267790c4054211c02244995ffcea2
16.0 kB Download
Source_PinkNoise.zip
md5:1eec7bfc1c2b0878eede74c1e9ceb45d
10.0 MB Download
Source_SinePureTones.zip
md5:7a46eb018c92a815ae1908d4ee2ebe07
14.0 MB Download
Source_SpeechMaterial(Female).zip
md5:7ce3db3ca8346f41500d74beff155f2e
294.1 MB Download
Source_SpeechMaterial(Male).zip
md5:d4feb6c4f401a5286b4cb40082e887c9
277.2 MB Download
Source_SpeechMaterial(Mandarin).zip
md5:ffa55ad2995fc6c102a4cadc9cea6224
76.0 MB Download
Source_WhiteNoise.zip
md5:cd75356bff33469756cd248419f1e287
10.1 MB Download
131
173
views
downloads
All versions This version
Views 131131
Downloads 173173
Data volume 252.7 GB252.7 GB
Unique views 105105
Unique downloads 2323

Share

Cite as