Published May 24, 2017 | Version v2
Video/Audio Open

NTCD-TIMIT

Authors/Creators

  • 1. International Computer Science Institute

Description

NTCD-TIMIT: A New Database and Baseline for Noise-robust Audio-visual Speech Recognition

Although audio-visual speech is well known to improve the robustness properties of automatic speech recognition (ASR) systems against noise, the realm of audio-visual ASR (AV-ASR) has not gathered the research momentum it deserves. This is mainly due to the lake of audio-visual corpora and the need to combine two fields of knowledge: ASR and computer vision. This paper describes the NTCD-TIMIT database and baseline that can overcome these two barriers and attract more research interests to AV-ASR. The database has been created by adding six noise types at a range of signal-to-noise ratios to the speech material of the recently published TCD-TIMIT corpus. The database also includes visual features that have been extracted from the TCD-TIMIT video recordings using the visual front-end presented in this paper. NTCD-TIMIT contains Kaldi scripts for training and decoding audio-only, video-only, and audio-visual ASR models. The baseline experiments and results obtained using these scripts are detailed in this paper.

Files

README.txt

Files (45.4 GB)

Name Size Download all
md5:f3ff201d8caea2f2149c4e1ca9366bd9
929.7 MB Download
md5:6e4b9942bad25d7ba762f6c3566eb564
890.0 MB Download
md5:c69fca3240570fefcba3818b39e5043d
816.9 MB Download
md5:d2221e055476ee3c0db9a041a2aba479
785.6 MB Download
md5:d643dfe2b3ba2b7bc245412cf46c8b19
756.5 MB Download
md5:513235b95e0618cba965275e65f7d265
851.7 MB Download
md5:2a02e531ac671e045d6c0603300b46f1
936.0 MB Download
md5:20a4c5c4ec1009817bd4bf4e7e62d045
899.0 MB Download
md5:4a01ef3814c483f301469fc05863fc48
827.2 MB Download
md5:7f650323f61ad8d21aeeec78faa1e217
796.2 MB Download
md5:f4fd02fe71e47343f96df01ac4f77dcc
767.1 MB Download
md5:3e6ff4e2b1dccbf567bf6a44b1fc1888
861.8 MB Download
md5:148056684e7a272afd0a182adb7b6671
998.0 MB Download
md5:9beaa328ed866da26e1d8fc9d8743684
964.0 MB Download
md5:31a88e4ef7d7dede237d73f0a75c6612
874.0 MB Download
md5:ae6c5c29ad4a7e8506a18fc8cbb7a754
829.3 MB Download
md5:8bdc1fa92d4018a732653ba38de721a6
788.9 MB Download
md5:a7273621f91691dcd0163d90668acb76
920.5 MB Download
md5:7efe2c785461b4b7449b2cc67b5ef9ad
651.9 MB Download
md5:5972279595399236300b1d947ce3aed8
957.7 kB Download
md5:e0bf78ad520157b73b4a2f8556b53196
941.3 MB Download
md5:853e57eac086475f950d9c5bc972415e
908.3 MB Download
md5:117a1c3415a6bd34f947831d8203d26c
836.0 MB Download
md5:ab0114d30d257a46c5a29a40fb63702c
802.5 MB Download
md5:34e74a40f0c9c720a3c6e6e1754b252a
772.5 MB Download
md5:5a2a8abb7f4e4cc7f9ea184e6658c423
871.4 MB Download
md5:870b2e14b87e67d202117f275c293e49
2.2 kB Preview Download
md5:a5589031d0baf5d62f4b9342c8fe34d3
952.5 MB Download
md5:f221aebca8d8d241b8a88d276b5f522e
914.9 MB Download
md5:1756891889e2436746c52a53d27e0694
839.2 MB Download
md5:3a767d9efc10f826b2c3f18283812cda
805.9 MB Download
md5:1492dae97544630023d997d004e38cd9
776.3 MB Download
md5:ff1a7ab13f5b17f104921e49b6433268
876.0 MB Download
md5:4c7164482eb293cbd13e57724499c70e
416.7 MB Download
md5:d992cea29f11ea83a1902296c506b147
13.3 GB Download
md5:06f47b7d769c36ce3119482f9ac92c93
953.7 MB Download
md5:ed645325310074047a69581e2fd83a30
918.9 MB Download
md5:45ac727da015bf360bf8047c9341c4e5
855.7 MB Download
md5:0c08c202d2c29f0ff92614e3552361a5
828.4 MB Download
md5:0ae4014625734251c232c84260b7af43
803.0 MB Download
md5:cd82f61c3c5dba05971f504fe6c12e5d
885.7 MB Download