Pressel2016 — Imagined speech EEG (Spanish vowels and directional commands, re-hosted)

Pressel Coretto, Germán A.; Gareis, Iván E.; Rufiner, Hugo Leonardo

doi:10.5281/zenodo.19502780

Published April 11, 2026 | Version v1

Dataset Open

Pressel2016 — Imagined speech EEG (Spanish vowels and directional commands, re-hosted)

1. Universidad Nacional de Entre Ríos

Re-hosted EEG data from Pressel Coretto, Gareis, and Rufiner (2017), originally distributed through Google Drive (file id 0By7apHbIp8ENZVBLRFVlSFhzbHc) as an open-access companion to the SIPAIM/SPIE proceedings paper. Repackaged into per-subject ZIP archives for easier programmatic access from the MOABB benchmarking framework.

Paper

G. A. Pressel Coretto, I. E. Gareis, and H. L. Rufiner, “Open access database of EEG signals recorded during imagined speech,” in Proc. SPIE 10160, 12th International Symposium on Medical Information Processing and Analysis, 2017.
DOI: 10.1117/12.2255697

Paradigm

11-class imagined speech of Spanish phonemes and directional commands:

Vowels: /a/, /e/, /i/, /o/, /u/ (stim codes 1-5)
Directional commands: arriba (up), abajo (down), adelante (forward), atrás (back), derecha (right), izquierda (left) (stim codes 6-11)

Two modalities were recorded per stimulus: imagined speech (modality code 1) and pronounced speech (modality code 2). The MOABB adapter loads the imagined condition by default.

Each trial is 4 seconds at 1024 Hz (4096 samples).

Participants

15 Argentinian volunteers (7 female, 8 male, ages 24-28)
Institution: Universidad Nacional de Entre Ríos (Argentina)

Recording setup

Parameter	Value
Amplifier	Grass 8-18-36
ADC	DataTranslation DT9816
Channels	6 (F3, F4, C3, C4, P3, P4)
Sampling rate	1024 Hz
Bandpass	2–45 Hz
File format	MATLAB (.mat)

File structure

15 per-subject ZIP files (S01.zip–S15.zip). Each contains a single MATLAB file renamed from the original S{NN}_EEG.mat to sub-{NN}_eeg.mat:

S01.zip
  sub-01_eeg.mat
S02.zip
  sub-02_eeg.mat
...
S15.zip
  sub-15_eeg.mat
README.md   ← top-level, stimulus codes and channels

The .mat files are bit-identical to the originals; only the filename was changed to adopt the MOABB subject-ID convention.

Data format

Each .mat file contains an EEG variable of shape (n_trials, 6 * 4096 + 3). The first 24576 columns are flattened channel-first EEG samples; the last three columns are labels: [modality, stimulus_code, artifact_flag].

Loading with MOABB

from moabb.datasets import Pressel2016
from moabb.paradigms import MotorImagery

dataset = Pressel2016()
paradigm = MotorImagery(
    events=["vowel_a", "vowel_e", "vowel_i"],
)
X, y, metadata = paradigm.get_data(dataset=dataset, subjects=[1])

Re-hosting rationale

The original Google Drive share relies on the gdown library and a legacy resourcekey token. Both are fragile: Google Drive can quota-block downloads, change its confirm-token protocol, or revoke resource keys (we observed all three modes of failure). This Zenodo mirror provides direct, DOI-addressed URLs so the data can be fetched from any CI environment without credentials and without depending on consumer cloud storage.

The signal data is unchanged — only filename conventions and archive layout have been normalized.

License and attribution

The original SPIE publication describes this dataset as “open access” and “publicly available” but does not declare an explicit Creative Commons license. This mirror is published under Zenodo’s other-open category on the same terms as the authors’ original distribution: open for research use with attribution to the paper (10.1117/12.2255697). If you are one of the dataset authors and wish to clarify the license or request removal of this mirror, please contact germanpressel@gmail.com (the corresponding author listed in the paper) or file an issue at NeuroTechX/moabb.

Notes

Re-hosted copy of the imagined-speech EEG database distributed by Pressel Coretto, Gareis, and Rufiner as a companion to DOI 10.1117/12.2255697 (SPIE / SIPAIM 2016). The authoritative upstream source is the authors' Google Drive file 0By7apHbIp8ENZVBLRFVlSFhzbHc, which is also what their institutional page at fich.unl.edu.ar/sinc/downloads/imagined_speech/ redirects to. The EEG signal data is bit-identical to that source; only filenames were normalized to a sub-NN convention (sub-01_eeg.mat was S01_EEG.mat) and the Base de Datos Habla Imaginada/ prefix was dropped for cleaner extraction. The authors' original Protocolo.txt (Spanish) and Registros_sujetos.xlsx are included unchanged. The authors' paper calls the dataset 'open access' but does not grant an explicit Creative Commons license. This mirror is provided under Zenodo's 'other-open' category on the same implicit terms: open for research with attribution to the paper. Takedown on author request. Authors of this dataset may contact germanpressel@gmail.com (corresponding author) or file an issue at https://github.com/NeuroTechX/moabb to clarify the license or request removal.

Files

README.md

Files (1.8 GB)

Name	Size
README.md md5:2a6b72d141ef42282d9aee0e1c734e54	2.8 kB	Preview Download
S01.zip md5:4817bb4dbf702b8d8ad5988b39cec1b8	130.8 MB	Preview Download
S02.zip md5:6e75e5b0d8cbb01b83b95d298273c9ec	115.9 MB	Preview Download
S03.zip md5:375b400a45081548ff84c399a4582e39	122.0 MB	Preview Download
S04.zip md5:7f35046031a339439e77662f4bc21c5e	114.4 MB	Preview Download
S05.zip md5:7b138d25cedf50d25a31de03a61fa6c4	109.3 MB	Preview Download
S06.zip md5:3fed79d4463cffbc8eda9fca7dd18498	115.7 MB	Preview Download
S07.zip md5:1b2dc1184309476f8abce6413b0fd25f	111.2 MB	Preview Download
S08.zip md5:92dc4f7bfdfa2fe910dc4a43b9cffaae	120.5 MB	Preview Download
S09.zip md5:caa965dadd1fd70f77085004ac1b37b7	122.6 MB	Preview Download
S10.zip md5:7830f80d54f5f14b3d180479ad49be05	120.0 MB	Preview Download
S11.zip md5:56b0dd6bf3fc840b821030dc3607b199	119.5 MB	Preview Download
S12.zip md5:c4243d545650efd3456ab7fe2820025c	116.6 MB	Preview Download
S13.zip md5:720ed35c3789cfd901fe6c5b35671043	118.2 MB	Preview Download
S14.zip md5:a9f67c0b27aa7dcc557d6f9daa402ed3	121.5 MB	Preview Download
S15.zip md5:3ec785dd12b747aaa608af63560cfc99	113.2 MB	Preview Download

Additional details

Is derived from: https://drive.google.com/uc?export=download&id=0By7apHbIp8ENZVBLRFVlSFhzbHc (URL)
Is part of: https://github.com/NeuroTechX/moabb (URL)
Is supplement to: 10.1117/12.2255697 (DOI)

	All versions	This version
Views	95	95
Downloads	541	541
Data volume	60.2 GB	60.2 GB

Pressel2016 — Imagined speech EEG (Spanish vowels and directional commands, re-hosted)

Authors/Creators

Description

Paper

Paradigm

Participants

Recording setup

File structure

Data format

Loading with MOABB

Re-hosting rationale

License and attribution

Notes

Files

README.md

Files (1.8 GB)

Additional details

Related works