The DALI dataset

doi:10.5281/zenodo.3576083

Published February 26, 2019 | Version v2

Dataset Restricted

The DALI dataset

Meseguer Brocal, Gabriel¹

1. Ircam

Researchers:

1. Ircam
2. Telecom ParisTech

The DALI dataset is a large Dataset of synchronised audio, lyrics and notes for the audio full-duration, with – its time-aligned lyrics and – its time-aligned notes (of the vocal melody). Lyrics are described according to four levels of granularity: notes (and textual information un- derlying a given note), words, lines and paragraphs. For each song, we also provide additional multimodal information such as genre, language, musician, album covers or links to video clips.

Go to https://github.com/gabolsgabs/DALI where you can find all the tools to work with the DALI dataset and a detailed description of how to use it.

For this version cite the article:

@article{meseguer2020creating,
  title={Creating DALI, a Large Dataset of Synchronized Audio, Lyrics, and Notes},
  author={Meseguer-Brocal, Gabriel and Cohen-Hadria, Alice and Peeters, Geoffroy},
  journal={Transactions of the International Society for Music Information Retrieval}, volume={3}, number={1}, year={2020},
  publisher={Ubiquity Press}
}

and the original paper:

@inproceedings{meseguer2019dali,
  title={Dali: A large dataset of synchronized audio, lyrics and notes, automatically created using teacher-student machine learning paradigm},
  author={Meseguer-Brocal, Gabriel and Cohen-Hadria, Alice and Peeters, Geoffroy},
  journal={arXiv preprint arXiv:1906.10606},
  year={2019}
}

This research has received funding from the French National Research Agency under the contract ANR-16-CE23-0017-01 (WASABI project)

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.

Request access

If you would like to request access to these files, please fill out the form below.

DALI by Gabriel Meseguer-Brocal, Alice Cohen-Hadrian and Peeters Geoffroy. DALI is offered free of charge for non-commercial research use only under the terms of the Creative Commons Attribution Noncommercial License: http://creativecommons.org/licenses/by-nc-sa/4.0/

The DALI is provided for educational purposes only and the material contained in them should not be used for any commercial purpose without the express permission of the copyright holders.

Please provide your affiliation and planned application in the justification message.

You are currently not logged in. Do you have an account? Log in here

	All versions	This version
Views	8,673	2,706
Downloads	770	356
Data volume	3.5 TB	3.4 TB

The DALI dataset

Researchers:

Files

Restricted

Request access

Additional details

References

The DALI dataset

Creators

Contributors

Researchers:

Description

Files

Restricted

Request access

Additional details

References