There is a newer version of this record available.

Dataset Open Access

Arab-Andalusian music lyrics dataset

Sordo, Mohamed; Chaachoo, Mehdi; Serra, Xavier

The dataset contains lyrics for the songs in Arab-Anadalusian music collection curated within the CompMusic project, that belong to the nawbas "Isbahan", "Maya" and "Raml Maya".

Lyrics are stored in two formats: as Tab Separated Values (TSV) files and as JSON files.

Each file is identified by its MusicBrainz recording ID (MBID).

The lyrics are stored both in their original arabic script (folder 'original') and a romanized/transliterated version (folder 'transliterated') using the American Library of Congress (ALA-LC standard).

Corresponding audio files are available from the Arab-Andalusian music corpus, as well as the Internet Archive URL included in the metadata file ('metadata.tsv').

For more information, please refer to

Files (332.8 kB)
Name Size
332.8 kB Download
All versions This version
Views 488303
Downloads 5940
Data volume 32.7 MB13.3 MB
Unique views 414276
Unique downloads 5136


Cite as