Dataset Open Access
The dataset contains lyrics for the songs in Arab-Anadalusian music collection curated within the CompMusic project, that belong to the nawbas "Isbahan", "Maya" and "Raml Maya".
Lyrics are stored in two formats: as Tab Separated Values (TSV) files and as JSON files.
Each file is identified by its MusicBrainz recording ID (MBID).
The lyrics are stored both in their original arabic script (folder 'original') and a romanized/transliterated version (folder 'transliterated') using the American Library of Congress (ALA-LC standard).
Corresponding audio files are available from the Arab-Andalusian music corpus, as well as the Internet Archive URL included in the metadata file ('metadata.tsv').
For more information, please refer to http://compmusic.upf.edu/corpora