EMOPIA: A Multi-Modal Pop Piano Dataset For Emotion Recognition and Emotion-based Music Generation
- 1. Academia Sinica
- 2. KAIST
- 3. Georgia Institute of Technology
Description
EMOPIA (pronounced ‘yee-mò-pi-uh’) dataset is a shared multi-modal (audio and MIDI) database focusing on perceived emotion in pop piano music, to facilitate research on various tasks related to music emotion. The dataset contains 1,087 music clips from 387 songs and clip-level emotion labels annotated by four dedicated annotators.
For more detailed information about the dataset, please refer to our paper: EMOPIA: A Multi-Modal Pop Piano Dataset For Emotion Recognition and Emotion-based Music Generation.
File Description
- midis/: midi clips transcribed using GiantMIDI.
- Filename `Q1_xxxxxxx_2.mp3`: Q1 means this clip belongs to Q1 on the V-A space; xxxxxxx is the song ID on YouTube, and the `2` means this clip is the 2nd clip taken from the full song.
- metadata/: metadata from YouTube. (Got when crawling)
-
songs_lists/: YouTube URLs of songs.
-
tagging_lists/: raw tagging result for each sample.
-
label.csv: metadata that records filename, clip timestamps, and annotator.
-
metadata_by_song.csv: list all the clips by the song. Can be used to create the train/val/test splits to avoid the same song appear in both train and test.
-
scripts/prepare_split.ipynb: the script to create train/val/test splits and save them to csv files.
Cite this dataset
@inproceedings{{EMOPIA},
author = {Hung, Hsiao-Tzu and Ching, Joann and Doh, Seungheon and Kim, Nabin and Nam, Juhan and Yang, Yi-Hsuan},
title = {{MOPIA}: A Multi-Modal Pop Piano Dataset For Emotion Recognition and Emotion-based Music Generation},
booktitle = {Proc. Int. Society for Music Information Retrieval Conf.},
year = {2021}
}
Files
EMOPIA_1.0.zip
Files
(5.5 MB)
Name | Size | Download all |
---|---|---|
md5:8f760ddcc014d144f1e2c5451bf003ac
|
5.5 MB | Preview Download |