Published February 5, 2014 | Version 1.0
Dataset Restricted

Carnatic Music Rhythm Dataset

  • 1. Music Technology Group, Universitat Pompeu Fabra, Barcelona, Spain
  • 2. Boğaziçi University
  • 3. Bogaziçi University, Department of Computer Engineering

Description

CompMusic Carnatic Rhythm Dataset is a rhythm annotated test corpus for automatic rhythm analysis tasks in Carnatic Music. The collection consists of audio excerpts from the CompMusic Carnatic research corpus, manually annotated time aligned markers indicating the progression through the taala cycle, and the associated taala related metadata. A brief description of the dataset is provided below. For a brief overview and audio examples of taalas in Carnatic music, please see http://compmusic.upf.edu/examples-taala-carnatic

THE DATASET

Audio music content 

The pieces are chosen from the CompMusic Carnatic music collection. The pieces were chosen in four popular taalas of Carnatic music, which encompasses a majority of Carnatic music. The pieces were chosen include a mix of vocal and instrumental recordings, new and old recordings, and to span a wide variety of forms. All pieces have a percussion accompaniment, predominantly Mridangam. The excerpts are full length pieces or a part of the full length pieces. There are also several different pieces by the same artist (or release group), and multiple instances of the same composition rendered by different artists. Each piece is uniquely identified using the MBID of the recording. The pieces are stereo, 160 kbps, mp3 files sampled at 44.1 kHz.

Annotations

There are several annotations that accompany each excerpt in the dataset.

Sama and beats: The primary annotations are audio synchronized time-stamps indicating the different metrical positions in the taala cycle. The annotations were created using Sonic Visualizer by tapping to music and manually correcting the taps. Each annotation has a time-stamp and an associated numeric label that indicates the position of the beat marker in the taala cycle. The marked positions in the taala cycle are shown with numbers, along with the corresponding label used. In each case, the sama (the start of the cycle, analogous to the downbeat) are indicated using the numeral 1.

Taala related metadata: For each excerpt, the taala of the piece, edupu (offset of the start of the piece, relative to the sama, measured in aksharas) of the composition, and the kalai (the cycle length scaling factor) are recorded. Each excerpt can be uniquely identified and located with the MBID of the recording, and the relative start and end times of the excerpt within the whole recording. A separate 5 digit taala based unique ID is also provided for each excerpt as a double check. The artist, release, the lead instrument, and the raaga of the piece are additional editorial metadata obtained from the release. A flag indicates if the excerpt is a full piece or only a part of a full piece. There are optional comments on audio quality and annotation specifics. 

Possible uses of the dataset

Possible tasks where the dataset can be used include taala, sama and beat tracking, tempo estimation and tracking, taala recognition, rhythm based segmentation of musical audio, structural segmentation, audio to score/lyrics alignment, and rhythmic pattern discovery.

Dataset organization

The dataset consists of audio, annotations, an accompanying spreadsheet providing additional metadata. For a detailed description of the organization, please see the README in the dataset.

Data Subset

A subset of this dataset consisting of 118 two minute excerpts of music is also available. The content in the subset is equaivalent and is separately distributed for a quicker testing of algorithms and approaches.

Using this dataset

Please cite the following publications if you use the dataset in your work:

Srinivasamurthy, A., Holzapfel, A., Cemgil, A. T., & Serra, X. (2015, October). Particle Filters for Efficient Meter Tracking with Dynamic Bayesian Networks. In Proceedings of the 16th International Society for Music Information Retrieval Conference (ISMIR 2015) (pp. 197–203). Malaga, Spain. (Subset)

http://hdl.handle.net/10230/34998 

Srinivasamurthy, A., & Serra, X. (2014, May). A Supervised Approach to Hierarchical Metrical Cycle Tracking from Audio Music Recordings. In Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014) (pp. 5237–5241). Florence, Italy. (Full dataset)

http://doi.org/10.1109/ICASSP.2014.6854598

We are interested in knowing if you find our datasets useful! If you use our dataset please email us at mtg-info@upf.edu and tell us about your research.

Contact

If you have any questions or comments about the dataset, please feel free to write to us.

Ajay Srinivasamurthy

Music Technology Group

Universitat Pompeu Fabra, 

Barcelona, Spain

ajays.murthy@upf.edu

 

http://compmusic.upf.edu/carnatic-rhythm-dataset

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.

Request access

If you would like to request access to these files, please fill out the form below.

You need to satisfy these conditions in order for this request to be accepted:

The audio in the dataset is copyrighted material sourced from commercially available music releases. Please use it only for non-commercial research purposes and do not distribute it further.

The annotations are released under Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)

https://creativecommons.org/licenses/by-nc-nd/4.0/

Please include in the justification field your academic affiliation (if you have one) and a brief description of your research topics and why you would like to use this dataset. If you do not include this information we may not approve your request.

For further details, please contact

Ajay Srinivasamurthy
ajays.murthy@upf.edu

You are currently not logged in. Do you have an account? Log in here

Additional details

Related works

References

  • Srinivasamurthy, A., Holzapfel, A., Cemgil, A. T., & Serra, X. (2015, October). Particle Filters for Efficient Meter Tracking with Dynamic Bayesian Networks. In Proceedings of the 16th International Society for Music Information Retrieval Conference (ISMIR 2015) (pp. 197–203). Malaga, Spain. (Subset)
  • Srinivasamurthy, A., & Serra, X. (2014, May). A Supervised Approach to Hierarchical Metrical Cycle Tracking from Audio Music Recordings. In Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014) (pp. 5237–5241). Florence, Italy. (Full dataset)