SENSE-42: A multimodal human-computer interaction dataset for neurocognitive user state evaluation
Authors/Creators
Contributors
Data collector (2):
Description
The dataset includes multimodal signals across the following channels:
Behavioural Data
Participant interaction logs collected via PsychoPy, including task performance metrics, reaction times, and mouse & keyboard status per frame.
Electroencephalography (EEG)
32-channel scalp EEG recordings collected with the BioSemi ActiveTwo system at 1024 Hz. EEG signals are stored in .bdf format. We further shared the cleaned EEG files in .set format compatible with EEGLab in this repository.
Electrocardiography (ECG)
3-lead ECG signals extracted from the external channels of the EEG system. Electrode placement follows a modified lead configuration (see diagram in the GitHub repository).
Respiration Data
Respiratory cycles captured using a thoracic breathing belt, collected with BioSemi ActiveTwo system, processed with RespInPeace and stored as .wav files at 32 Hz.
Webcam Recordings
High-resolution facial video recordings were obtained for facial expression and pupilometry analysis.
All participants agreed to share their data for authorised research use.
Abstract
SENSE-42 is a publicly available, multimodal dataset designed to support the study of user state monitoring during extended computer interaction sessions via neurocognitive, physiological and behavioural responses. Combining high-resolution neurophysiological recordings with behavioural and subjective data, this dataset enables research on the alternations of attention, mental/physical fatigue, cognitive workload, and related subjective indices at a very early stage.
The dataset was collected from 42 participants over a 2-hour continuous interaction session, during which participants engaged in a series of designed tasks on a desktop computer with a mouse and keyboard. The experimental tasks were conducted within a fully simulated desktop operating system environment, designed to closely mimic real-world computer usage scenarios. This setup mirrors how people typically use computers in daily life. The simulated experiment program also enables the comprehensive capture of the mouse and keyboard data, synchronised with a high refresh rate monitor at 144 Hz. Recordings were collected in a noise-insulated room, minimising external interruptions from the environment or the experimenter.
Other
We introduce a multimodal dataset named Simulated Environment for Neurocognitive State Evaluation (SENSE-42), designed to study early alertness state fluctuations during realistic computer interaction in a desktop operating system environment. To accommodate user preferences while maintaining a unified experimental design, we developed precise replicas of two major operating system interfaces (macOS and Windows) to facilitate the study of task-related behavioural responses. Both interface styles were presented to each participant during the session, allowing within-subject comparisons across familiar and unfamiliar environments. The dataset includes synchronised high-resolution neurocognitive responses in multiple modalities, including physiological, behavioural, and self-reported data, from 42 participants within a simulated desktop operating system. These responses were collected over continuous 2-hour sessions for each participant, providing rich measurements of spontaneous fluctuations in user states. Structured tasks were designed to emulate everyday computer use—file management, typing, browsing, and application switching—under tightly controlled conditions. Participants self-reported their alertness state at five-minute intervals using predefined scales, including the Karolinska Sleepiness Scale (KSS), NASA Task Load Index (NASA-TLX), and attentiveness ratings. Physiological data were collected to profile internal user states from EEG, ECG and respiration signals, and behavioural data were recorded for keyboard and mouse dynamics. Additionally, trait-level measures such as the Epworth Sleepiness Scale (ESS) and the Pittsburgh Sleep Quality Index (PSQI) were collected to contextualise inter-individual variability in daytime sleepiness and nighttime sleep patterns.
The SENSE-42 dataset is collected in a time-locked experimental setting to replicate real-world computer use scenarios. It provides a structured resource for examining tonic alertness dynamics during human-computer interaction and may be used to explore relationships between various multimodal indicators of the user state and their personal traits. The dataset includes components relevant to research in areas such as cognitive state monitoring, user behaviour analysis, and physiological computing.
Methods
A total of 42 participants were recruited in our study, consisting of 21 males and 21 females. The average age of participants is 26.8 +/- 4.5 years, ranging from 18 to 40. Participants represented diverse ethnic backgrounds, education levels, and occupations. To preserve ecological validity, they were instructed to maintain their usual intake of substances that could influence alertness, such as caffeine or medication. The study was approved by the Queen Mary Ethics of Research Committee (Reference Number: PSY2025-32), and the procedures were designed in accordance with the Data Protection Privacy Notice of Queen Mary University of London. Detailed explanations of the study and its procedures were provided to the participants, and informed consent for data publication with optional data-sharing preferences was obtained from all participants. Participants received £30 for taking part in the study.
Notes
Files
Additional details
Identifiers
Related works
- Is cited by
- Conference paper: 10.1145/3772363.3798603 (DOI)
- Is published in
- Preprint: 10.1101/2025.09.03.673947 (DOI)
Funding
- China Scholarship Council
- PhD Scholarship 202309210085
Software
- Repository URL
- https://github.com/Catherine9811/HCI-SENSE-42
- Programming language
- Python
- Development Status
- Active