Published October 10, 2022
| Version 1.0
Dataset
Restricted
N20EM dataset for multimodal lyric transcription
Authors/Creators
- 1. National University of Singapore
Description
Note: To access the dataset, please submit the request at Zenodo page of the newer version of this dataset.
N20EM dataset for multimodal lyric transcription, proposed in our ACM MM 2022 paper, MM-ALT: A Multimodal Automatic Lyric Transcription System.
It contains recordings of three modalities: audio, video, and IMU motion signal.
Please cite our work as:
@inproceedings{gu2022mm,
title={MM-ALT: A multimodal automatic lyric transcription system},
author={Gu, Xiangming and Ou, Longshen and Ong, Danielle and Wang, Ye},
booktitle={Proceedings of the 30th ACM International Conference on Multimedia},
pages={3328--3337},
year={2022}
}
Our paper's camera ready version: https://arxiv.org/abs/2207.06127
Project website: https://n20em.github.io/
Files
Additional details
References
- Gu, X., Ou, L., Ong, D. and Wang, Y., 2022, October. Mm-alt: A multimodal automatic lyric transcription system. In Proceedings of the 30th ACM International Conference on Multimedia (pp. 3328-3337).