N20EM dataset for multimodal lyric transcription

Gu, Xiangming; Ou, Longshen; Ong, Danielle; Wang, Ye

doi:10.5281/zenodo.7545968

Published October 10, 2022 | Version 1.1

Dataset Open

N20EM dataset for multimodal lyric transcription

1. National University of Singapore

N20EM dataset for multimodal lyric transcription, proposed in our ACM MM 2022 paper, MM-ALT: A Multimodal Automatic Lyric Transcription System. This dataset contains recordings of three modalities: audio, video, and IMU motion signal.

Our paper's camera ready version: https://arxiv.org/abs/2207.06127

Project website: https://n20em.github.io/

Note:

Once you download the dataset, we assume you have read and agreed with the Terms and Conditions.
Commercial usage is strictly prohibited.

Please cite our work as:

@inproceedings{gu2022mm, title={MM-ALT: A multimodal automatic lyric transcription system}, author={Gu, Xiangming and Ou, Longshen and Ong, Danielle and Wang, Ye}, booktitle={Proceedings of the 30th ACM International Conference on Multimedia}, pages={3328--3337}, year={2022} }

Notes

Add data for VAD training and accompaniment of each utterance.

Files

accompaniment.zip

Files (5.5 GB)

Name	Size
accompaniment.zip md5:79640d19a13311ed853907d45b7b687d	1.4 GB	Preview Download
IMU_VAD_data.tar.gz md5:9cd08c1eb28017a28da17b3985f02f10	1.1 GB	Download
n20em_v1.0.zip md5:6fb79f5ec7f8a2efd040c62a0419f1ce	3.0 GB	Preview Download
readme.txt md5:2594828ce15753cd3a508fe38029646e	245 Bytes	Preview Download

Additional details

Is published in: Conference paper: 10.1145/3503161.3548411 (DOI)

Gu, X., Ou, L., Ong, D. and Wang, Y., 2022, October. Mm-alt: A multimodal automatic lyric transcription system. In Proceedings of the 30th ACM International Conference on Multimedia (pp. 3328-3337).

	All versions	This version
Views	1,535	771
Downloads	368	367
Data volume	631.6 GB	628.6 GB

accompaniment.zip

Files (5.5 GB)

Related works

References

N20EM dataset for multimodal lyric transcription

Authors/Creators

Description

Notes

Files

accompaniment.zip

Files (5.5 GB)

Additional details

Related works

References