Dataset Open Access
Akshay Anantapadmanabhan; Ashwin Bellur; Hema A. Murthy
{ "files": [ { "links": { "self": "https://zenodo.org/api/files/9a16f90a-5148-4256-9409-5f760d153431/mridangam_stroke_1.5.zip" }, "checksum": "md5:39af55b2476b94c7946bec24331ec01a", "bucket": "9a16f90a-5148-4256-9409-5f760d153431", "key": "mridangam_stroke_1.5.zip", "type": "zip", "size": 130280712 } ], "owners": [ 46759 ], "doi": "10.5281/zenodo.4068196", "stats": { "version_unique_downloads": 680.0, "unique_views": 226.0, "views": 303.0, "version_views": 982.0, "unique_downloads": 215.0, "version_unique_views": 818.0, "volume": 39084213600.0, "version_downloads": 936.0, "downloads": 300.0, "version_volume": 130102885920.0 }, "links": { "doi": "https://doi.org/10.5281/zenodo.4068196", "conceptdoi": "https://doi.org/10.5281/zenodo.1265187", "bucket": "https://zenodo.org/api/files/9a16f90a-5148-4256-9409-5f760d153431", "conceptbadge": "https://zenodo.org/badge/doi/10.5281/zenodo.1265187.svg", "html": "https://zenodo.org/record/4068196", "latest_html": "https://zenodo.org/record/4068196", "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.4068196.svg", "latest": "https://zenodo.org/api/records/4068196" }, "conceptdoi": "10.5281/zenodo.1265187", "created": "2020-10-06T12:22:00.624775+00:00", "updated": "2020-10-16T09:25:43.695482+00:00", "conceptrecid": "1265187", "revision": 6, "id": 4068196, "metadata": { "access_right_category": "success", "doi": "10.5281/zenodo.4068196", "description": "<p>The Mridangam Stroke dataset is a collection of 6977 audio examples of individual strokes of the Mridangam in various tonics. The dataset comprises of 10 different strokes played on Mridangams with 6 different tonic values.</p>\n\n<p>This is an updated version of the dataset, as the original version 1.0 presents some silent or wrong annotated tracks.</p>\n\n<p><strong>Audio content</strong></p>\n\n<p>The dataset provides audio examples for each of the strokes. There are six different tonics and ten different stroke labels.</p>\n\n<p>The audio examples were recorded from a professional Carnatic percussionist in a semi-anechoic studio conditions by Akshay Anantapadmanabhan using SM-58 microphones and an H4n ZOOM recorder. The audio was sampled at 44.1 kHz and stored as 16 bit wav files. The dataset can be used for training models for each Mridangam stroke. </p>\n\n<p><strong>Metadata</strong></p>\n\n<p>The whole dataset is organized by the tonic, into 6 packs. Each audio file is named as, </p>\n\n<pre><code><StrokeName>_<Tonic>_<InstanceNumber>.wav\n<Tonic> = {B, C, Csh, D, Dsh, E}\n<StrokeName> = {Bheem, Cha, Dheem, Dhin, Num, Ta, Tha, Tham, Thi, Thom}</code></pre>\n\n<p><strong>Using this dataset</strong></p>\n\n<p>A detailed description of the Mridangam and its strokes can be found in the following paper. A part of the dataset was used in the paper. Please cite it if you use the dataset in your work.</p>\n\n<blockquote>\n<p>Akshay Anantapadmanabhan, Ashwin Bellur, Hema A. Murthy, "Modal analysis and transcription of strokes of the mridangam using non-negative matrix factorization," in Proc. of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013), pp.181-185, May 2013</p>\n</blockquote>\n\n<p><a href=\"http://hdl.handle.net/10230/25756\">http://hdl.handle.net/10230/25756</a></p>\n\n<p>We are interested in knowing if you find our datasets useful! If you use our dataset please email us at <a href=\"mailto:mtg-info@upf.edu\">mtg-info@upf.edu</a> and tell us about your research.</p>\n\n<p><strong>Contact</strong></p>\n\n<p>If you have any questions or comments about the dataset, please feel free to write to us: </p>\n\n<p>Akshay Anantapadmanabhan (akshay.anantapadmanabhan@gmail.com)</p>\n\n<p> </p>\n\n<p><a href=\"http://compmusic.upf.edu/mridangam-stroke-dataset\">http://compmusic.upf.edu/mridangam-stroke-dataset</a></p>", "contributors": [], "title": "Mridangam Stroke Dataset", "license": { "id": "CC-BY-NC-4.0" }, "relations": { "version": [ { "count": 2, "index": 1, "parent": { "pid_type": "recid", "pid_value": "1265187" }, "is_last": true, "last_child": { "pid_type": "recid", "pid_value": "4068196" } } ] }, "language": "eng", "version": "1.5", "references": [ "Akshay Anantapadmanabhan, Ashwin Bellur, Hema A. Murthy, \"Modal analysis and transcription of strokes of the mridangam using non-negative matrix factorization,\" in Proc. of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013), pp.181-185, May 2013" ], "communities": [ { "id": "compmusic" }, { "id": "mdm-dtic-upf" }, { "id": "mtgupf" } ], "publication_date": "2020-10-06", "creators": [ { "affiliation": "Department of Computer Science and Engineering, Indian Institute of Technology, Madras, India", "name": "Akshay Anantapadmanabhan" }, { "affiliation": "Department of Electrical Engineering, Indian Institute of Technology, Madras, India", "name": "Ashwin Bellur" }, { "affiliation": "Department of Computer Science and Engineering, Indian Institute of Technology, Madras, Indiaat Pompeu Fabra", "name": "Hema A. Murthy" } ], "meeting": { "url": "https://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?punumber=6619549", "dates": "26-31 May 2013", "place": "Vancouver, BC, Canada", "title": "2013 IEEE International Conference on Acoustics, Speech and Signal Processing" }, "access_right": "open", "resource_type": { "type": "dataset", "title": "Dataset" }, "related_identifiers": [ { "scheme": "doi", "identifier": "10.1109/ICASSP.2013.6637633", "relation": "cites" }, { "scheme": "doi", "identifier": "10.5281/zenodo.1265187", "relation": "isVersionOf" } ] } }
All versions | This version | |
---|---|---|
Views | 982 | 303 |
Downloads | 936 | 300 |
Data volume | 130.1 GB | 39.1 GB |
Unique views | 818 | 226 |
Unique downloads | 680 | 215 |