Published April 11, 2026 | Version v1
Dataset Open

Pressel2016 — Imagined speech EEG (Spanish vowels and directional commands, re-hosted)

  • 1. Universidad Nacional de Entre Ríos

Description

Re-hosted EEG data from Pressel Coretto, Gareis, and Rufiner (2017), originally distributed through Google Drive (file id 0By7apHbIp8ENZVBLRFVlSFhzbHc) as an open-access companion to the SIPAIM/SPIE proceedings paper. Repackaged into per-subject ZIP archives for easier programmatic access from the MOABB benchmarking framework.

Paper

G. A. Pressel Coretto, I. E. Gareis, and H. L. Rufiner, “Open access database of EEG signals recorded during imagined speech,” in Proc. SPIE 10160, 12th International Symposium on Medical Information Processing and Analysis, 2017.
DOI: 10.1117/12.2255697

Paradigm

11-class imagined speech of Spanish phonemes and directional commands:

  • Vowels: /a/, /e/, /i/, /o/, /u/ (stim codes 1-5)
  • Directional commands: arriba (up), abajo (down), adelante (forward), atrás (back), derecha (right), izquierda (left) (stim codes 6-11)

Two modalities were recorded per stimulus: imagined speech (modality code 1) and pronounced speech (modality code 2). The MOABB adapter loads the imagined condition by default.

Each trial is 4 seconds at 1024 Hz (4096 samples).

Participants

  • 15 Argentinian volunteers (7 female, 8 male, ages 24-28)
  • Institution: Universidad Nacional de Entre Ríos (Argentina)

Recording setup

ParameterValue
AmplifierGrass 8-18-36
ADCDataTranslation DT9816
Channels6 (F3, F4, C3, C4, P3, P4)
Sampling rate1024 Hz
Bandpass2–45 Hz
File formatMATLAB (.mat)

File structure

15 per-subject ZIP files (S01.zipS15.zip). Each contains a single MATLAB file renamed from the original S{NN}_EEG.mat to sub-{NN}_eeg.mat:

S01.zip
  sub-01_eeg.mat
S02.zip
  sub-02_eeg.mat
...
S15.zip
  sub-15_eeg.mat
README.md   ← top-level, stimulus codes and channels

The .mat files are bit-identical to the originals; only the filename was changed to adopt the MOABB subject-ID convention.

Data format

Each .mat file contains an EEG variable of shape (n_trials, 6 * 4096 + 3). The first 24576 columns are flattened channel-first EEG samples; the last three columns are labels: [modality, stimulus_code, artifact_flag].

Loading with MOABB

from moabb.datasets import Pressel2016
from moabb.paradigms import MotorImagery

dataset = Pressel2016()
paradigm = MotorImagery(
    events=["vowel_a", "vowel_e", "vowel_i"],
)
X, y, metadata = paradigm.get_data(dataset=dataset, subjects=[1])

Re-hosting rationale

The original Google Drive share relies on the gdown library and a legacy resourcekey token. Both are fragile: Google Drive can quota-block downloads, change its confirm-token protocol, or revoke resource keys (we observed all three modes of failure). This Zenodo mirror provides direct, DOI-addressed URLs so the data can be fetched from any CI environment without credentials and without depending on consumer cloud storage.

The signal data is unchanged — only filename conventions and archive layout have been normalized.

License and attribution

The original SPIE publication describes this dataset as “open access” and “publicly available” but does not declare an explicit Creative Commons license. This mirror is published under Zenodo’s other-open category on the same terms as the authors’ original distribution: open for research use with attribution to the paper (10.1117/12.2255697). If you are one of the dataset authors and wish to clarify the license or request removal of this mirror, please contact germanpressel@gmail.com (the corresponding author listed in the paper) or file an issue at NeuroTechX/moabb.

Notes

Re-hosted copy of the imagined-speech EEG database distributed by Pressel Coretto, Gareis, and Rufiner as a companion to DOI 10.1117/12.2255697 (SPIE / SIPAIM 2016). The authoritative upstream source is the authors' Google Drive file 0By7apHbIp8ENZVBLRFVlSFhzbHc, which is also what their institutional page at fich.unl.edu.ar/sinc/downloads/imagined_speech/ redirects to. The EEG signal data is bit-identical to that source; only filenames were normalized to a sub-NN convention (sub-01_eeg.mat was S01_EEG.mat) and the Base de Datos Habla Imaginada/ prefix was dropped for cleaner extraction. The authors' original Protocolo.txt (Spanish) and Registros_sujetos.xlsx are included unchanged. The authors' paper calls the dataset 'open access' but does not grant an explicit Creative Commons license. This mirror is provided under Zenodo's 'other-open' category on the same implicit terms: open for research with attribution to the paper. Takedown on author request. Authors of this dataset may contact germanpressel@gmail.com (corresponding author) or file an issue at https://github.com/NeuroTechX/moabb to clarify the license or request removal.

Files

README.md

Files (1.8 GB)

Name Size Download all
md5:2a6b72d141ef42282d9aee0e1c734e54
2.8 kB Preview Download
md5:4817bb4dbf702b8d8ad5988b39cec1b8
130.8 MB Preview Download
md5:6e75e5b0d8cbb01b83b95d298273c9ec
115.9 MB Preview Download
md5:375b400a45081548ff84c399a4582e39
122.0 MB Preview Download
md5:7f35046031a339439e77662f4bc21c5e
114.4 MB Preview Download
md5:7b138d25cedf50d25a31de03a61fa6c4
109.3 MB Preview Download
md5:3fed79d4463cffbc8eda9fca7dd18498
115.7 MB Preview Download
md5:1b2dc1184309476f8abce6413b0fd25f
111.2 MB Preview Download
md5:92dc4f7bfdfa2fe910dc4a43b9cffaae
120.5 MB Preview Download
md5:caa965dadd1fd70f77085004ac1b37b7
122.6 MB Preview Download
md5:7830f80d54f5f14b3d180479ad49be05
120.0 MB Preview Download
md5:56b0dd6bf3fc840b821030dc3607b199
119.5 MB Preview Download
md5:c4243d545650efd3456ab7fe2820025c
116.6 MB Preview Download
md5:720ed35c3789cfd901fe6c5b35671043
118.2 MB Preview Download
md5:a9f67c0b27aa7dcc557d6f9daa402ed3
121.5 MB Preview Download
md5:3ec785dd12b747aaa608af63560cfc99
113.2 MB Preview Download

Additional details