There is a newer version of this record available.

Dataset Open Access

TinySOL: an audio dataset of isolated musical notes

Carmine Emanuele; Daniele Ghisi; Vincent Lostanlen; Fabien Lévy; Joshua Fineberg; Yan Maresz


JSON-LD (schema.org) Export

{
  "description": "<p>TinySOL<br>\n=======<br>\nVersion 4.0, January 2020.<br>\n&nbsp;</p>\n\n<p>&nbsp;</p>\n\n<p>Created By<br>\n--------------</p>\n\n<p>Carmine-Emanuele Cella (1), Daniele Ghisi (1), Vincent Lostanlen (2), Fabien L&eacute;vy (3), Joshua Fineberg (4), Yan Maresz (5)<br>\n<br>\n(1): UC Berkeley<br>\n(2): New York University<br>\n(3): Columbia University<br>\n(4): Boston University<br>\n(5): Conservatoire de Paris</p>\n\n<p>&nbsp;</p>\n\n<p>Description<br>\n---------------</p>\n\n<p><br>\nTinySOL is a dataset of 2478 samples, each containing a single musical note from one of 14 different instruments:</p>\n\n<ol>\n\t<li>Bass Tuba</li>\n\t<li>French Horn</li>\n\t<li>Trombone</li>\n\t<li>Trumpet in C</li>\n\t<li>Accordion</li>\n\t<li>Contrabass</li>\n\t<li>Violin</li>\n\t<li>Viola</li>\n\t<li>Violoncello</li>\n\t<li>Bassoon</li>\n\t<li>Clarinet in B-flat</li>\n\t<li>Flute</li>\n\t<li>Oboe</li>\n\t<li>Alto Saxophone</li>\n</ol>\n\n<p>&nbsp;</p>\n\n<p>These sounds were originally recorded at Ircam in Paris (France) between 1996 and 1999, as part of a larger project named Studio On Line (SOL). Although SOL contains many combinations of mutes and extended playing techniques, TinySOL purely consists of sounds played in the so-called &quot;ordinary&quot; style, and in absence of mute.<br>\n<br>\nTinySOL can be used for creative purposes insofar at the use complies with the Creative Commons Attribution 4.0 International license (see below).<br>\n<br>\nTinySOL can be used for education and research purposes. In particular, it can be employed as a dataset for training and/or evaluating music information retrieval (MIR) systems, for tasks such as instrument recognition or fundamental frequency estimation. For this purpose, we provide an official 5-fold split of TinySOL. This split has been carefully balanced in terms of instrumentation, pitch range, and dynamics. For the sake of research reproducibility, we encourage users of TinySOL to adopt this split and report their results in terms of average performance across folds.</p>\n\n<p>&nbsp;</p>\n\n<p>Data Files<br>\n--------------</p>\n\n<p>TinySOL contains 2478 audio clips as WAV files, sampled at 44.1&nbsp;kHz, with a single channel (mono), at a bit depth of 16. This is equivalent to the audio quality of a compact disc. Audio clips vary in duration between two and ten seconds.</p>\n\n<p>Every audio file has a file path of the form:<br>\n&lt;FAMILY&gt;/&lt;INSTRUMENT&gt;/ordinario/&lt;INSTR&gt;-ord-&lt;PITCH&gt;-&lt;DYNAMICS&gt;-&lt;MISC&gt;.wav</p>\n\n<p><br>\nwhere:</p>\n\n<ul>\n\t<li>&lt;FAMILY&gt; corresponds to the instrument family: &quot;Brass&quot;, &quot;Keyboards&quot; (includes accordion), &quot;Strings&quot;, and &quot;Winds&quot; (i.e., woodwinds).</li>\n\t<li>&lt;INSTRUMENT&gt; is the full name of the instrument.</li>\n\t<li>&quot;ordinario&quot; denotes the ordinary playing technique. This is in contrast with the rest of the SOL dataset, which also encompasses extended playing techniques</li>\n\t<li>&lt;PITCH&gt; denotes the pitch of the musical note. This pitch is encoded in the American standard pitch notation: pitch class (C means &quot;do&quot;) followed by pitch octave. According to this convention, A4 has a fundamental frequency of 440 Hz.</li>\n\t<li>&lt;DYN&gt; denotes the intensity dynamics, ranked from pp (pianissimo) to ff (fortissimo).</li>\n\t<li>&lt;MISC&gt; contains additional information, when applicable. For example, for bowed string instruments, the same pitch may sometimes be achieved on different positions and different strings, resulting in small timbre differences. In this case the label &quot;1c&quot;, &quot;2c&quot;, &quot;3c&quot;, or &quot;4c&quot; denotes the string which is being bowed. (The letter c originates from the word &quot;corde&quot;, which means string in French.) By convention, the first string is the one with the highest pitch when played as an open string. Furthermore, some pitches were never recorded, and thus missing from the chromatic scale. In this case, the &lt;MISC&gt; tag contains a letter &quot;R&quot;, to denote the fact that the corresponding WAV file has been obtained by transforming a different audio clip via some digital frequency transposition. The letter &quot;R&quot; stands for &quot;resampled&quot;. If none of these tags apply, the &lt;MISC&gt; field becomes &quot;N&quot;, which stands for &quot;None&quot;.</li>\n</ul>\n\n<p>For example, &quot;Strings/Violin/ordinario/Vn-ord-G#6-mf-1cR.wav&quot; corresponds to:</p>\n\n<ul>\n\t<li>a violin sound ;</li>\n\t<li>played in the ordinary playing technique ;</li>\n\t<li>at pitch G#6 (approximately 1661 Hz) ;</li>\n\t<li>with mezzoforte dynamics ;</li>\n\t<li>on the first string ; and</li>\n\t<li>resampled from a different sound, as opposed to genuinely recorded.</li>\n</ul>\n\n<p>&nbsp;</p>\n\n<p>Metadata File<br>\n-------------------</p>\n\n<p>The TinySOL_metadata.csv file contains 2478 rows, one for each audio clip. It can be opened by a text editor or by a spreadsheet software application. It contains 13 columns:</p>\n\n<ol>\n\t<li>Path to the WAV file, in UNIX filesystem format. For Windows compatibility, replace the slashes (&quot;/&quot;) by backslashes (&quot;\\&quot;). Ex: &quot;Brass/BTb/BTb-ord-A#1-ff-N.wav&quot;</li>\n\t<li>Fold ID. Either equal to 0, 1, 2, 3, or 4.</li>\n\t<li>Family. Ex: &quot;Brass&quot;</li>\n\t<li>Instrument abbreviation. Ex: &quot;BTb&quot;</li>\n\t<li>Instrument name in full. Ex: &quot;Bass Tuba&quot;</li>\n\t<li>Technique abbreviation. Always equal to &quot;ord&quot; in the case of TinySOL.</li>\n\t<li>Technique name in full. Always equal to &quot;ordinario&quot; in the case of TinySOL.</li>\n\t<li>Pitch. Ex: &quot;A#1&quot;</li>\n\t<li>Pitch ID in MIDI format. Ex: 34. Integer in the range 0-127.</li>\n\t<li>Dynamics. Ex: &quot;ff&quot;.</li>\n\t<li>Dynamics ID. Integer. pp maps to 0 and ff maps to 4. The higher, the louder.</li>\n\t<li>Resampled. True if the file has been pitch-shifted; False otherwise.</li>\n\t<li>String ID. Equal to 1, 2, 3, 4, or empty if not applicable.</li>\n</ol>\n\n<p>&nbsp;</p>\n\n<p>&nbsp;</p>\n\n<p>Conditions of Use<br>\n------------------------</p>\n\n<p>TinySOL was created in 2020 by Carmine-Emanuele Cella, Daniele Ghisi, Vincent Lostanlen, Fabien L&eacute;vy, Joshua Fineberg, and Yan Maresz.</p>\n\n<p>TinySOL is a derivative of SOL. We wish to thank Hugues Vinet and all coordinators of the Ircam Forum for their authorization to upload TinySOL to Zenodo.</p>\n\n<p>TinySOL is offered free of charge under the terms of the Creative Commons Attribution 4.0 International (CC BY 4.0) license:<br>\nhttps://creativecommons.org/licenses/by/4.0/</p>\n\n<p>The dataset and its contents are made available on an &quot;as is&quot; basis and without warranties of any kind, including without limitation satisfactory quality and conformity, merchantability, fitness for a particular purpose, accuracy or&nbsp;completeness, or absence of errors. Subject to any liability that may not be excluded or limited by law, the authors are&nbsp;not liable for, and expressly exclude&nbsp;all liability for, loss or damage however and whenever caused to anyone by any use of the TinySOL dataset or any part of it.</p>\n\n<p>We encourage TinySOL users to subscribe to the Ircam Forum so that they can have access to larger versions of SOL. While downloading full version of SOL requires premium membership (for a yearly fee), a medium-sized version named OrchideaSOL is made available free of charge to all members. Note, however, that TinySOL is the only subset of SOL which is released under a Creative Commons License. For more information, please visit: https://forum.ircam.fr/</p>\n\n<p>&nbsp;</p>\n\n<p>Versions<br>\n-----------<br>\n1.0 was released on January 31st, 2020.<br>\n2.0 and 3.0 were released the same day, after fixing an issue in the metadata related to file paths.<br>\n4.0 was released on February 7th, 2020. The file structure of the tar.gz file was simplified so as to improve the interoperability with the mirdata Python package.</p>\n\n<p>&nbsp;</p>\n\n<p>Feedback<br>\n-------------</p>\n\n<p>Please help us improve TinySOL by sending your feedback to:<br>\ncarmine.cella@berkeley.edu</p>\n\n<p>For issues regarding the metadata encoding, the five-fold split, or the TinySOL module in mirdata, please write to:<br>\nvincent.lostanlen@nyu.edu</p>\n\n<p>In case of a problem, please include as many details as possible.</p>\n\n<p>&nbsp;</p>\n\n<p>&nbsp;</p>", 
  "license": "https://creativecommons.org/licenses/by/4.0/legalcode", 
  "creator": [
    {
      "@type": "Person", 
      "name": "Carmine Emanuele"
    }, 
    {
      "@type": "Person", 
      "name": "Daniele Ghisi"
    }, 
    {
      "@id": "https://orcid.org/0000-0003-0580-1651", 
      "@type": "Person", 
      "name": "Vincent Lostanlen"
    }, 
    {
      "@type": "Person", 
      "name": "Fabien L\u00e9vy"
    }, 
    {
      "@type": "Person", 
      "name": "Joshua Fineberg"
    }, 
    {
      "@type": "Person", 
      "name": "Yan Maresz"
    }
  ], 
  "url": "https://zenodo.org/record/3659365", 
  "datePublished": "2020-01-31", 
  "version": "4.0", 
  "keywords": [
    "music information retrieval", 
    "computer music", 
    "audio signal processing", 
    "music cognition"
  ], 
  "@context": "https://schema.org/", 
  "distribution": [
    {
      "contentUrl": "https://zenodo.org/api/files/56385b50-d096-4308-b546-82e93d888b15/TinySOL_metadata.csv", 
      "encodingFormat": "csv", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/56385b50-d096-4308-b546-82e93d888b15/TinySOL.tar.gz", 
      "encodingFormat": "gz", 
      "@type": "DataDownload"
    }
  ], 
  "identifier": "https://doi.org/10.5281/zenodo.3659365", 
  "@id": "https://doi.org/10.5281/zenodo.3659365", 
  "@type": "Dataset", 
  "name": "TinySOL: an audio dataset of isolated musical notes"
}
723
756
views
downloads
All versions This version
Views 72318
Downloads 75615
Data volume 208.8 GB3.6 GB
Unique views 61915
Unique downloads 53712

Share

Cite as