Indian Art Music Melodic Similarity Dataset

Gulati, Sankalp; Serrà, Joan; Serra, Xavier

doi:10.5281/zenodo.16631794

Published April 19, 2015 | Version v2

Dataset Open

Indian Art Music Melodic Similarity Dataset

1. Pompeu Fabra University

This dataset comprises audio excerpts and manually done annotations of the melodic phrases in Carnatic and Hindustani music. This dataset can be used to develop and evaluate approaches for computing melodic similarity between short-time melodic patterns in Indian art music. This dataset is divided into two parts, one for Carnatic music (CMD), and the other for Hindustani music (HMD).

There are two versions of the dataset available:

Original version

These two datasets, CMD and HMD are compiled originally by the authors of iswar2013 and ross2012, respectively. Though, they have evolved over time and have been recompiled along with the extracted audio features.

Please cite if you use the material shared here in your research work.

Gulati, S., Serrà, J., & Serra, X. (2015). An evaluation of methodologies for melodic similarity in audio recordings of Indian art music. In Proceedings of the 40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 678–682. Brisbane, Australia.

[Postprint PDF@MTG]

Improved version

It was found that several instances of melodic phrases were not marked in the annotations. The missing phrases have been added in the improved version of the dataset.

Please cite the following publication if you use the material shared here in your research work.

Gulati, S., Serrà, J., & Serra, X. (2015). Improving melodic similarity in Indian art music using culture-specific melodic characteristics. In Proceedings of the 16th International Society for Music Information Retrieval Conference (ISMIR), pp. 680–686. Málaga, Spain.

[Postprint PDF@MTG]

Dataset structure

The dataset is divided into two parts, Carnatic and Hindustani.

Carnatic has 23 folders for each song. In each folder, there are the following files named:

<song identifier>.mp3: Performance audio.

<song identifier>.anot: Contains the original annotations.

<song identifier>.anotEdit1: Contains the improved annotations.

<song identifier>.flatSegNyas: Contains nyas annotations.

<song identifier>.pitch: Contains original pitch annotations.

<song identifier>.pitchSilIntrpPP: Contains improved pitch annotations.

<song identifier>.tonic: Tonic of the performance.

<song identifier>.tonicFine: Finetuned tonic of the performance.

Hindustani has 9 folders for each song. In each folder, there are the following files named:

<song identifier>.wav: Performance audio.

<song identifier>.anot: Contains the original annotations.

<song identifier>.anotEdit4: Contains the improved annotations.

<song identifier>.flatSegNyas: Contains nyas annotations.

<song identifier>.tpe: Contains original pitch annotations.

<song identifier>.tpe5msSilIntrpPP: Contains improved pitch annotations.

<song identifier>.tonic: Tonic of the performance.

<song identifier>.tonicFine: Finetuned tonic of the performance.

Annotation file contains tab separated values with format as:

<start_time><tab><end_time><tab><id of the melodic phrase>

Mirdata

This dataset is included in mirdata. Use the following code snippet to access the dataset in mirdata.

# Import midata
import mirdata

# Initialize dataset
dataset_name = 'compmusic_iamms'
data_home = 'mirdata/dataset'
dataset = mirdata.initialize(dataset_name, data_home=data_home)

# Download dataset
dataset.download()

# Validate dataset
dataset.validate()

# Load dataset as a dictionary with track ids as keys and track objects as values
data = dataset.load_tracks()

Contact

If you have any questions or comments about the dataset, please feel free to email:

mtg-info@upf.edu

Files

MelodicSimilarityDataset.zip

Files (400.5 MB)

Name	Size	Download all
MelodicSimilarityDataset.zip md5:d02c3f329558f91de2fe3bd613f6f2f5	400.5 MB	Preview Download

Additional details

European Commission
COMPMUSIC - Computational models for the discovery of the world's music 267583

Gulati, S., Serrà, J., & Serra, X. (2015). An evaluation of methodologies for melodic similarity in audio recordings of Indian art music. In Proceedings of the 40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 678–682. Brisbane, Australia.
Gulati, S., Serrà, J., & Serra, X. (2015). Improving melodic similarity in Indian art music using culture-specific melodic characteristics. In Proceedings of the 16th International Society for Music Information Retrieval Conference (ISMIR), pp. 680–686. Málaga, Spain.
Ishwar, V., Dutta, S., Bellur, A., & Murthy, H. (2013). Motif spotting in an Alapana in Carnatic music. In Proc. of Int. Conf. on Music Information Retrieval (ISMIR), pp. 499–504. Ross, J. C., Vinutha, T. P., & Rao, P. (2012). Detecting melodic motifs from audio for Hindustani classical music. In Proc. of Int. Conf. on Music Information Retrieval (ISMIR), pp. 193–198.
Ross, J. C., Vinutha, T. P., & Rao, P. (2012). Detecting melodic motifs from audio for Hindustani classical music. In Proc. of Int. Conf. on Music Information Retrieval (ISMIR), pp. 193–198.

	All versions	This version
Views	442	318
Downloads	149	93
Data volume	65.3 GB	42.1 GB

Original version

Improved version

Dataset structure

Mirdata

Contact

MelodicSimilarityDataset.zip

Files (400.5 MB)

Funding

References

Indian Art Music Melodic Similarity Dataset

Authors/Creators

Description

Original version

Improved version

Dataset structure

Mirdata

Contact

Files

MelodicSimilarityDataset.zip

Files (400.5 MB)

Additional details

Funding

References