There is a newer version of the record available.

Published October 10, 2022 | Version 1.0
Dataset Restricted

N20EM dataset for multimodal lyric transcription

  • 1. National University of Singapore

Description

Note: To access the dataset, please submit the request at Zenodo page of the newer version of this dataset.

N20EM dataset for multimodal lyric transcription, proposed in our ACM MM 2022 paper, MM-ALT: A Multimodal Automatic Lyric Transcription System.

It contains recordings of three modalities: audio, video, and IMU motion signal. 

Please cite our work as:

@inproceedings{gu2022mm,
  title={MM-ALT: A multimodal automatic lyric transcription system},
  author={Gu, Xiangming and Ou, Longshen and Ong, Danielle and Wang, Ye},
  booktitle={Proceedings of the 30th ACM International Conference on Multimedia},
  pages={3328--3337},
  year={2022}
}

Our paper's camera ready version: https://arxiv.org/abs/2207.06127

Project website: https://n20em.github.io/

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.

Request access

If you would like to request access to these files, please fill out the form below.

You need to satisfy these conditions in order for this request to be accepted:

Hi! A newer version of the dataset has been published in this link:
https://zenodo.org/record/7545968
Please submit your access request in the newer page instead. We welcome you to follow our work!

You are currently not logged in. Do you have an account? Log in here

Additional details

References

  • Gu, X., Ou, L., Ong, D. and Wang, Y., 2022, October. Mm-alt: A multimodal automatic lyric transcription system. In Proceedings of the 30th ACM International Conference on Multimedia (pp. 3328-3337).