Published August 13, 2022 | Version 1.0
Dataset Open

Hi, KIA: A Speech Emotion Recognition Dataset for Wake-Up Words

  • 1. Department of Industrial Design, KAIST
  • 2. Graduate School of Culture Technology, KAIST
  • 3. KIA Design Studio, Hyundai Motor Company

Description

Hi,KIA dataset is a shared short Wakeup Word database focusing on perceived emotion in speech The dataset contains 488 Wakeup Word speech. 

For more detailed information about the dataset, please refer to our paper: Hi, KIA: A Speech Emotion Recognition Dataset for Wake-Up Words

File Description

  • wav/: wav files.
    • Filename f`{gender}_{pid}_{scene}_{trial}_{emotion}.wav` The first letter was used to express emotion.
       
  • annotation/: Information related to annotation and human validation of the entire speech
  • split: 8fold data split with {train, valid, test}.csv 

  • handcraft: Features used for data EDA and baseline performance

  • best_weights: wav2vec2.0 context network finetuning weights for re-implementation. Due to file size, we attach only fold M1, F5

 

Reference

Hi, KIA: A Speech Emotion Recognition Dataset for Wake-Up Words [[ArXiv](https://arxiv.org/abs/2211.03371)]

```
@inproceedings{kim2022hi,
  title={Hi, KIA: A Speech Emotion Recognition Dataset for Wake-Up Words},
  author={Taesu Kim, SeungHeon Doh, Gyunpyo Lee, Hyung seok Jun, Juhan Nam, Hyeon-Jeong Suk},
  booktitle={Proceedings of the 14th Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)},
  year={2022}
}
```

Files

Files (750.0 MB)

Name Size Download all
md5:dec963a8200052ada3a436ad7c8ecdad
750.0 MB Download