N20EM dataset for multimodal lyric transcription

Gu, Xiangming; Ou, Longshen; Ong, Danielle; Wang, Ye

doi:10.5281/zenodo.6905332

Published October 10, 2022 | Version 1.0

Dataset Restricted

N20EM dataset for multimodal lyric transcription

1. National University of Singapore

Note: To access the dataset, please submit the request at Zenodo page of the newer version of this dataset.

N20EM dataset for multimodal lyric transcription, proposed in our ACM MM 2022 paper, MM-ALT: A Multimodal Automatic Lyric Transcription System.

It contains recordings of three modalities: audio, video, and IMU motion signal.

Please cite our work as:

@inproceedings{gu2022mm,
  title={MM-ALT: A multimodal automatic lyric transcription system},
  author={Gu, Xiangming and Ou, Longshen and Ong, Danielle and Wang, Ye},
  booktitle={Proceedings of the 30th ACM International Conference on Multimedia},
  pages={3328--3337},
  year={2022}
}

Our paper's camera ready version: https://arxiv.org/abs/2207.06127

Project website: https://n20em.github.io/

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.

Request access

If you would like to request access to these files, please fill out the form below.

Hi! A newer version of the dataset has been published in this link:
https://zenodo.org/record/7545968
Please submit your access request in the newer page instead. We welcome you to follow our work!

You are currently not logged in. Do you have an account? Log in here

Additional details

Gu, X., Ou, L., Ong, D. and Wang, Y., 2022, October. Mm-alt: A multimodal automatic lyric transcription system. In Proceedings of the 30th ACM International Conference on Multimedia (pp. 3328-3337).

	All versions	This version
Views	1,320	673
Downloads	309	1
Data volume	562.4 GB	3.0 GB

N20EM dataset for multimodal lyric transcription

Authors/Creators

Description

Files

Restricted

Request access

Additional details

References