Published August 7, 2016 | Version 1.0
Dataset Open

Indian Art Music Raga Recognition Dataset (features)

Description

The Rāga Recognition Datasets (features) comprise two sizable datasets, one for each music tradition: the Carnatic Music Dataset (CMD) and the Hindustani Music Dataset (HMD). Each dataset entry includes features such as pitch, tonic, and nyas and tani segments. These datasets can be used to develop and evaluate approaches for automatic rāga recognition in Indian art music. To the best of our knowledge, they are the largest and most comprehensive datasets (in terms of available metadata) ever used for studying this task.

This repository only contains the metadata and computed features for the dataset, and shared in open access. To get the audio, please refer to this zenodo entry and submit your request.

 

Please cite the following publications if you use the material shared here in your research work.

Gulati, S., Serrà, J., Ganguli, K. K., ¸Sentürk, S., & Serra, X. (2016). Time-delayed melody surfaces for raga recognition. In Proceedings of the 17th International Society for Music Information Retrieval Conference (ISMIR), pp. 751–757. New York, USA. [Postprint PDF]

Gulati, S., Serrà, J., Ishwar, V., ¸Sentürk, S., & Serra, X. (2016). Phrase-based raga recognition using vector space modeling. In Proceedings of the 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 66–70. Shanghai, China. [Postprint PDF]

 

Annotation Format

We provide both tsv files and json files that contain information about each audio recording in terms of its mbid, the path of the audio/feature files and the associated rāga identifier. Each rāga is assigned a unique identifier by Dunya, which is similar to the mbid in terms of purpose. We also provide a mapping of the rāga id to its transliterated name.

Mirdata

This dataset is included in mirdata. Use the following code snippet to access the dataset in mirdata.

# Import midata
import mirdata

# Initialize dataset
dataset_name = 'compmusic_raga'
data_home = 'mirdata/dataset'
dataset = mirdata.initialize(dataset_name, data_home=data_home)

# Download dataset
dataset.download()

# Validate dataset
dataset.validate()

# Load dataset as a dictionary with track ids as keys and track objects as values
data = dataset.load_tracks()

In order to load the audio files in mirdata, they must be requested beforehand and placed in the data home directory.

Contact 

If you have any questions or comments about the dataset, please feel free to email:

mtg-info@upf.edu

 

Files

Indian Art Music Raga Recognition Dataset (features).zip

Files (3.6 GB)

Additional details

Funding

European Commission
COMPMUSIC - Computational models for the discovery of the world's music 267583

References

  • Gulati, S., Serrà, J., Ganguli, K. K., ¸Sentürk, S., & Serra, X. (2016). Time-delayed melody surfaces for raga recognition. In Proceedings of the 17th International Society for Music Information Retrieval Conference (ISMIR), pp. 751–757. New York, USA.
  • Gulati, S., Serrà, J., Ishwar, V., ¸Sentürk, S., & Serra, X. (2016). Phrase-based raga recognition using vector space modeling. In Proceedings of the 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 66–70. Shanghai, China.