Published March 25, 2019 | Version 1.0
Video/Audio Open

Test Database for the Assessment of Immersive Audio Systems

  • 1. University of York

Description

This repository contains a new library of listening material, for the testing of immersive audio systems, that includes synthetic sound sources, speech recordings and short musical and instrumental performances.

Evaluation of perceived audio quality is an essential part of spatial audio system design, where listening tests help to reveal any
spatial and timbral distortions that occur. Selection of audio stimuli constitutes an important part of listening test methods, as different stimuli will reveal specific properties of the perceived audio. A wide range of listening test material is therefore required, from which the most appropriate stimuli can be chosen based on the context of the test. For researchers in the field of immersive audio, availability of such materials can be sparse due to the differing requirements of surround sound and ambisonic testing. To this end a new test database has been developed, for use in the spatial and timbral evaluation of immersive audio systems.

---

The data is organised as follows:

Source Files
- Source_2-Pop (1kHz tone, one frame long (25ms or 50ms))
- Source_3rdOctaveBandPinkNoise (10 & 60 second durations, frequency bands; 32, 64, 125, 250, 500, 1k, 2k, 4k, 8k, 16kHz)
- Source_500-2000Hz_PinkNoise (Pink noise with frequencies below 500Hz removed & cut-off at 2kHz)
- Source_AcousticGuitar&Vocals (4 original pieces consisting of multiple guitar, vocal, drum & shaker tracks)
- Source_ConversationalSpeech (selection of short conversations & passages recorded in an anecohic chamber and reverberant classroom)
- Source_DTMF_Tones (Tone pairs consisting of lower & higher frequencies with durations of 1s, 10s, 100ms & 200ms)
- Source_GreenwichTimeSignal (series of five 0.1 second, 1 kHz tone bursts separated by 0.9 seconds of silence concluded by a 0.5 second 1 kHz tone)
- Source_PinkNoise (durations of 1s, 10s, 60s, 100ms & 200ms)
- Source_SinePureTones (1s, 10s, 60s, 100ms & 200ms durations, frequencies; 20, 32, 64, 125, 250, 440, 500, 1k, 2k, 4k,  8k, 16k, 20kHz)
- Source_SpeechMaterial(Female) (includes sentences & passages; speaker positions & names; azimuth & elevation angles (-180 to +180); Numbers, alphabet & assorted audio terms)
- Source_SpeechMaterial(Male) (includes sentences & passages; speaker positions & names; azimuth & elevation angles (-180 to +180); Numbers, alphabet & assorted audio terms)
- Source_SpeechMaterial(Mandarin) (includes only sentences & passages)
- Source_WhiteNoise (durations of 1s, 10s, 60s, 100ms & 200ms)

Ambisonically Encoded Files
- XOrder_3rdOctPinkNoise (X, where X = 1st, 3rd, 5th, 7th, Order encoded 1/3 Octave Band Pink noise files) 
    - Cube (sources encoded to cube face positions)
        - 3rdOctPinkNoise_32Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 32Hz center frequency)
        - 3rdOctPinkNoise_64Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 64Hz center frequency)
        - 3rdOctPinkNoise_125Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 125Hz center frequency)
        - 3rdOctPinkNoise_250Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 250Hz center frequency)
        - 3rdOctPinkNoise_500Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 500Hz center frequency)
        - 3rdOctPinkNoise_1000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 1kHz center frequency)
        - 3rdOctPinkNoise_2000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 2kHz center frequency)
        - 3rdOctPinkNoise_4000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 4kHz center frequency)
        - 3rdOctPinkNoise_8000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 8kHz center frequency)
        - 3rdOctPinkNoise_16000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 16kHz center frequency)
    - Dodecahedron (sources encoded to dodecahedron face positions)
        - 3rdOctPinkNoise_32Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 32Hz center frequency)
        - 3rdOctPinkNoise_64Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 64Hz center frequency)
        - 3rdOctPinkNoise_125Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 125Hz center frequency)
        - 3rdOctPinkNoise_250Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 250Hz center frequency)
        - 3rdOctPinkNoise_500Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 500Hz center frequency)
        - 3rdOctPinkNoise_1000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 1kHz center frequency)
        - 3rdOctPinkNoise_2000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 2kHz center frequency)
        - 3rdOctPinkNoise_4000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 4kHz center frequency)
        - 3rdOctPinkNoise_8000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 8kHz center frequency)
        - 3rdOctPinkNoise_16000Hz_-20dBFS_10s_48kHz_24Bit (10 seconds, 16kHz center frequency)
- XOrder_500-2000Hz_PinkNoise (X Order encoded 500-2000Hz Pink noise files)
    - Cube (sources encoded to cube face positions)
        - 500-2000Hz_PinkNoise_-20dBFS_1s_48kHz_24Bit (1 second)
        - 500-2000Hz_PinkNoise_-20dBFS_10s_48kHz_24Bit (10 seconds)
        - 500-2000Hz_PinkNoise_-20dBFS_60s_48kHz_24Bit (60 seconds)
        - 500-2000Hz_PinkNoise_-20dBFS_100ms_48kHz_24Bit (100 milliseconds)
        - 500-2000Hz_PinkNoise_-20dBFS_200ms_48kHz_24Bit (200 milliseconds)
    - Dodecahedron (sources encoded to dodecahedron face positions)
        - 500-2000Hz_PinkNoise_-20dBFS_1s_48kHz_24Bit (1 second)
        - 500-2000Hz_PinkNoise_-20dBFS_10s_48kHz_24Bit (10 seconds)
        - 500-2000Hz_PinkNoise_-20dBFS_60s_48kHz_24Bit (60 seconds)
        - 500-2000Hz_PinkNoise_-20dBFS_100ms_48kHz_24Bit (100 milliseconds)
        - 500-2000Hz_PinkNoise_-20dBFS_200ms_48kHz_24Bit (200 milliseconds)
- XOrder_BroadcastSources (X Order encoded 2-pip, GTS & DTMF tone files)
    - 2-Pop_-20dBFS_25ms_48kHz_24Bit (encoded to cube face positions)
    - 2-Pop_-20dBFS_50ms_48kHz_24Bit (encoded to cube face positions)
    - DTMF_Tones_-20dBFS_1s_48kHz_24Bit (1 second, encoded to front center position)
    - DTMF_Tones_-20dBFS_10s_48kHz_24Bit (10 seconds, encoded to front center position)
    - DTMF_Tones_-20dBFS_100ms_48kHz_24Bit (100 milliseconds, encoded to front center position)
    - DTMF_Tones_-20dBFS_200ms_48kHz_24Bit (200 milliseconds, encoded to front center position)
    - GTS_Full_-20dBFS_48kHz_24Bit (encoded to cube face positions)
- XOrder_ExampleTestFiles (X Order encoded Pink noise announced example test files e.g. "Front Center" *Noise burst at front center*)
    - 1Second (announced 1 second pink noise encoded to ITU-R BS.2159-4 and SMPTE 2603 speaker positions)
    - 100ms_3Bursts (announced 3 bursts of 100ms pink noise encoded to ITU-R BS.2159-4 and SMPTE 2603 speaker positions)
    - 200ms_3Bursts (announced 3 bursts of 200ms pink noise encoded to ITU-R BS.2159-4 and SMPTE 2603 speaker positions)
- XOrder_MovingSources (X Order encoded noise sources that circle azimuth/elevation at specified speeds)
    - PinkNoise_-20dBFS_60s_48kHz_24Bit (60 seconds, azimuth & elevation pink noise at 45, 90 & 180 degrees per second)
    - WhiteNoise_-20dBFS_60s_48kHz_24Bit (60 seconds, azimuth & elevation white noise at 45, 90 & 180 degrees per second)
- XOrder_PinkNoise (X Order encoded Pink noise files)
    - Cube (sources encoded to cube face positions)
        - PinkNoise_-20dBFS_1s_48kHz_24Bit (1 second)
        - PinkNoise_-20dBFS_10s_48kHz_24Bit (10 seconds)
        - PinkNoise_-20dBFS_60s_48kHz_24Bit (60 seconds)
        - PinkNoise_-20dBFS_100ms_48kHz_24Bit (100 milliseconds)
        - PinkNoise_-20dBFS_200ms_48kHz_24Bit (200 milliseconds)
    - Dodecahedron (sources encoded to dodecahedron face positions)
        - PinkNoise_-20dBFS_1s_48kHz_24Bit (1 second)
        - PinkNoise_-20dBFS_10s_48kHz_24Bit (10 seconds)
        - PinkNoise_-20dBFS_60s_48kHz_24Bit (60 seconds)
        - PinkNoise_-20dBFS_100ms_48kHz_24Bit (100 milliseconds)
        - PinkNoise_-20dBFS_200ms_48kHz_24Bit (200 milliseconds)
- XOrder_WhiteNoise (X Order encoded White noise files)
    - Cube (sources encoded to cube face positions)
        - WhiteNoise_-20dBFS_1s_48kHz_24Bit (1 second)
        - WhiteNoise_-20dBFS_10s_48kHz_24Bit (10 seconds)
        - WhiteNoise_-20dBFS_60s_48kHz_24Bit (60 seconds)
        - WhiteNoise_-20dBFS_100ms_48kHz_24Bit (100 milliseconds)
        - WhiteNoise_-20dBFS_200ms_48kHz_24Bit (200 milliseconds)
    - Dodecahedron (sources encoded to dodecahedron face positions)
        - WhiteNoise_-20dBFS_1s_48kHz_24Bit (1 second)
        - WhiteNoise_-20dBFS_10s_48kHz_24Bit (10 seconds)
        - WhiteNoise_-20dBFS_60s_48kHz_24Bit (60 seconds)
        - WhiteNoise_-20dBFS_100ms_48kHz_24Bit (100 milliseconds)
        - WhiteNoise_-20dBFS_200ms_48kHz_24Bit (200 milliseconds)

---

For any enquiries regarding the data please email: ho581@york.ac.uk

Data produced by Harry Ogden at the Audio Lab, Department of Electronics Engineering, University of York
Contact: ho581@york.ac.uk

Funding was provided by UK Engineering and Physical Sciences Research Council (EPSRC), the Department of Electronic Engineering at the University of York.

Files

1stOrder_3rdOctPinkNoise.zip

Files (69.8 GB)

Name Size Download all
md5:e8958d4064cdee93227a0f6a287f584a
738.0 MB Preview Download
md5:8b4b65525d5c12b466e64dfcf508b611
535.7 MB Preview Download
md5:56322feb71f15f2302fdfd48301d63ac
40.4 MB Preview Download
md5:049cb0a88f70cb872e0d2f96a92ac2b6
115.2 MB Preview Download
md5:ff165d6231fd269228291a561725d9d7
335.3 MB Preview Download
md5:fcc3acbf2612851bf4e1f2943688497a
534.6 MB Preview Download
md5:86fb5f7ef84adcd0a0e38dc4d4c93172
541.8 MB Preview Download
md5:3566b31006b32f8f6a071a60bc462945
2.6 GB Preview Download
md5:d13fc4bc664e1719137cfabb75fada7c
1.9 GB Preview Download
md5:2acc706543a574885d8777bace5752b4
179.5 MB Preview Download
md5:92736e5ab7e2b0f4f119ba9ae246cca6
410.3 MB Preview Download
md5:7e01bf1b601a99fd3e81604d29e92195
1.1 GB Preview Download
md5:3bd8e4d07d2629b5edce5c6ba34d84e0
1.3 GB Preview Download
md5:f432be2682df844b70286a29a94a8995
1.5 GB Preview Download
md5:dde667d0478b9356d9628ddd94436591
5.5 GB Preview Download
md5:bd00e676e1e4cd456d44f826e2a525b1
4.0 GB Preview Download
md5:df075da6c2f201f67bd238137312515d
391.0 MB Preview Download
md5:34648d640f75e8162507e622c7524c7b
868.1 MB Preview Download
md5:1a9422bfa91e8ede5c5676dc8ca44711
2.4 GB Preview Download
md5:53222fa38478456a98738a62e1c56fad
4.0 GB Preview Download
md5:a62cf6539595c05b4e2938c0e833eaaa
4.1 GB Preview Download
md5:045931db5e95bd79f98279a0187e813b
9.5 GB Preview Download
md5:b2bc68c65b62a6e45d5a77b477d905b2
6.8 GB Preview Download
md5:95bf472e66dd181dcb88f987ab61ee93
662.8 MB Preview Download
md5:88fb77b5a5549098ad03625122c5e513
1.5 GB Preview Download
md5:476836ddba7c7b142332417d72ed9d1f
3.4 GB Preview Download
md5:ff63b69c3050aadf2e485f0e75875671
6.8 GB Preview Download
md5:efda5321b32badc12950d28ed07b2feb
6.9 GB Preview Download
md5:0bc215d763572603e362cc47c41e92f1
2.0 kB Preview Download
md5:8614bd9a133d241d84d6de843622a6b4
97.6 MB Preview Download
md5:3fd88bc8bde58dd114007ccaf7a837ee
10.1 MB Preview Download
md5:d65a5c7e8014f599bd0a04b9a6069680
135.0 MB Preview Download
md5:b0fc1c44c31db954edf595e7c5da4b3c
350.0 MB Preview Download
md5:799243fee235d0d9006be2ee48f13fba
25.3 MB Preview Download
md5:ea4267790c4054211c02244995ffcea2
16.0 kB Preview Download
md5:1eec7bfc1c2b0878eede74c1e9ceb45d
10.0 MB Preview Download
md5:7a46eb018c92a815ae1908d4ee2ebe07
14.0 MB Preview Download
md5:7ce3db3ca8346f41500d74beff155f2e
294.1 MB Preview Download
md5:d4feb6c4f401a5286b4cb40082e887c9
277.2 MB Preview Download
md5:ffa55ad2995fc6c102a4cadc9cea6224
76.0 MB Preview Download
md5:cd75356bff33469756cd248419f1e287
10.1 MB Preview Download