Published May 23, 2026 | Version v3

VoiceStick: A French Spontaneous Speech Corpus for Voice-Guided Drone Teleoperation

Description

VoiceStick is the first French spontaneous speech corpus dedicated to voice-guided drone teleoperation. It was collected from 29 dyads of native French-speaking participants using an asymmetric guide-pilot paradigm in a mixed-reality setting. The corpus captures the natural dynamics of spontaneous vocal interaction in a real-time teleoperation task, including hesitations, reformulations, and prosodic variability.

The corpus comprises 4,219 utterances totaling 19,829 words, with a vocabulary of 669 unique words and 2,421 distinct spoken commands.

If you use this corpus, please cite:

Henry, A., Rossato, S., Graff, C., Gomez-Balderas, J.-E., & Huet, S. (2026). VoiceStick: A Spontaneous Speech Corpus for Drone Voice Guidance. *Proceedings of CORIA-TALN 2026*. Nantes, France.

Files

README.md

Files (360.6 MB)

Name Size
md5:1b2b4eadd95215db983428339595c270
360.2 MB Preview Download
md5:2766d5f76e86ea93ec4d097d5aa61c91
7.1 kB Preview Download
md5:2b233db85518344f1e2b35c78e8c9b11
431.7 kB Preview Download

Additional details

Funding

Agence Nationale de la Recherche
UGA - IDEX UGA ANR-15-IDEX-0002

Dates

Available
2026