Data description
The following is a description of directories and files in the dataset.
Stress_dataset.zip: The zip file holds the data of 15 participants in different folders. Each folder contains raw data signals in CSV format in a sub-folder. A raw data folder consists of 6 different CSV files, including (1) EDA.csv (electrodermal activity), (2) HR.csv (heart rate), (3) TEMP.csv (skin temperature), (4) IBI.csv (inter-beat interval), (5) BVP.csv (blood volume pulse), and (6) ACC.csv (accelerometer data).
Each biometric signal data has the following information:
- Start time (epoch): The DateTime float number that contains the time that signal was generated using the internal clock of the wristband. The DateTime is stored at the first row of every data column.
- Frequency: The second cell of each column shows the data collection frequency
ACC.csv:
- Column I: x-Axis acceleration
- Column II: y-Axis acceleration
- Column III: z-Axis acceleration
BVP.csv:
- Column I: Blood volume pulse is a method of measuring the heart rate.
EDA.csv:
- Column I: Electrodermal activity of the skin, measuring the skin's electrical conductivity.
IBI.csv:
- Column I: time interval
- Column II: Inter-beat interval or beat-to-beat interval, being the time interval between individual beats of an individual's heart.
TEMP.csv:
- Column I: Skin temperature in Celsius.
tags.csv: contains the timestamp of the user tag. A tag event occurs when the user clicks the button on the watch to mark an event. However, the subjects did not consistently use this feature and the field has no information value in our study.
In some cases, the sensor data csv files are empty. This constitutes a failure of the device to capture data.
Survey Results File
Each folder name is identical to the participant's ID in both data and survey files. All of the signals were synchronized to bring them to a common frequency. The accelerometer data is not used in the stress detection model. Some of the basic physical activities can be estimated from the accelerometer sensor, which could be further used to potentially include the activity context in stress detection.
SurveyResults.xlsx: The Excel file holds all participant survey results and their annotated stress level in Excel sheets (a sheet for each participant). Sheet names are the participant's IDs. However, the IDs are generated in an ID column for all files for more convenience. The following are the excel sheet columns:
- Column A: ID - Anonymized Id of the user.
- Column B: Start time - Event start time.
- Column C: End time - Event start time.
- Column D: Duration - Duration of the event.
- Column E: Date - Date of data collection.
- Column F: Stress level - Reported stress level by the nurse.
Nurses' responses regarding the nature of the stress.
- Column G: COVID Related
- Column H: Treating a COVID patient
- Column I: Patient in Crisis
- Column J: Patient or patient's family
- Column K: Doctors or colleagues
- Column L: Administration, lab, pharmacy, radiology, or other ancillary services
- Column M: Increased Workload
- Column N: Technology related stress
- Column O: Lack of supplies
- Column P: Documentation
- Column Q: Safety (physical or physiological threats)
- Column R: Lack of supplies
- Column S: Work Environment - Physical or others: work processes or procedures
- Column T: Description
Funding provided by: National Science Foundation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100000001
Award Number: 1650551
Funding provided by: National Science Foundation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100000001
Award Number: CNS-1429526
Funding provided by: Louisiana Board of Regents
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100006952
Award Number: LEQSF (2019-20)-ENH-DE-22