Silent Speech EMG

Gaddy, David

doi:10.5281/zenodo.4064409

Published October 2, 2020 | Version 1.0

Dataset Open

Silent Speech EMG

Gaddy, David¹

1. UC Berkeley

Facial electromyography recordings during both silent and vocalized speech.

This data is described in the publication "Digital Voicing of Silent Speech" at EMNLP 2020 (https://arxiv.org/abs/2010.02960).

Code for processing this data can be found at https://github.com/dgaddy/silent_speech.

Notes

Each data sample has 5 data files: {i}_emg.npy - a saved numpy array of size (T, 8) with the raw EMG signals; {i}_audio.flac - the raw audio recording; {i}_audio_clean.flac - audio with background noise reduced; {i}_info.json - JSON with extra information, such as the text prompt that was read; {i}_button.npy - a numpy array containing device button state, which is generally unused. Note that some samples do not represent actual datapoints, but are used as reference EMG or audio signals. These samples are marked with "sentence_index: -1" in the associated info file.

Files

Files (3.9 GB)

Name	Size	Download all
emg_data.tar.gz md5:7f97d2182b896652999b1b2d0c69fd7b	3.9 GB	Download

Views

Downloads

Show more details

	All versions	This version
Views	7,731	7,689
Downloads	2,696	2,668
Data volume	26.4 TB	26.3 TB

More info on how stats are collected....

DOI

Resource type

Dataset

Publisher

Zenodo

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: October 3, 2020
Modified: October 14, 2022

Silent Speech EMG

Authors/Creators

Description

Notes

Files

Files (3.9 GB)