Video/Audio Open Access

voiceHome corpus: A corpus dedicated to distant-microphone speech processing in domestic environments

Bertin Nancy; Camberlein Ewen; Lebarbenchon Romain; Peillon Stephane; Lamandé Eric; Sivasankaran Sunit; Bimbot Frédéric; Illina Irina; Tom Ariane; Fleury Sylvain; Jamet Eric


This corpus includes reverberated, noisy speech signals spoken by native French talkers in a lounge and recorded by an 8-microphone device at various angles and distances and in various noise conditions.

Room impulse responses and noise-only signals recorded in various real rooms and homes and baseline speaker localization and enhancement software are also provided.

This corpus stands apart from other corpora in the field by the number of rooms and homes considered and by the fact that it is publicly available at no cost.


Other materials:

Documentation (in french):

The corpus documentation is both available into the archive and hereafter by clicking on voiceHome_corpus_french_documentation_v1.2.pdf .

Terms of use

You may exploit the corpus for a non-commercial scientific purpose provided you mention it in any written work or software you derive from its use. Within a published article, paper or report, the corpus must appear in the bibliographical references.

Speaker records diffusion consent

All participants have given an informed and signed consent about public diffusion of recorded sentences.

New corpus version available : voiceHome-2 corpus

A new version of the corpus is available : voiceHome-2 corpus web page


Files (1.9 GB)
Name Size
1.9 GB Download
702.2 kB Download
All versions This version
Views 675674
Downloads 225225
Data volume 325.7 GB325.7 GB
Unique views 614613
Unique downloads 140140


Cite as