Dataset Open Access

Good-sounds dataset

Romani Picas, Oriol; Parra Rodriguez, Hector; Dabiri, Dara; Serra, Xavier


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">sound</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">goodness</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">timbre</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">instrument</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">MTG</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">good-sounds</subfield>
  </datafield>
  <controlfield tag="005">20190410041504.0</controlfield>
  <controlfield tag="001">820937</controlfield>
  <datafield tag="711" ind1=" " ind2=" ">
    <subfield code="d">7 May 2015</subfield>
    <subfield code="g">AES</subfield>
    <subfield code="a">138th Audio Engineering Society Convention</subfield>
    <subfield code="c">Warsaw</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Music Technology Group - Universitat Pompeu Fabra</subfield>
    <subfield code="a">Parra Rodriguez, Hector</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Music Technology Group - Universitat Pompeu Fabra</subfield>
    <subfield code="a">Dabiri, Dara</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Music Technology Group - Universitat Pompeu Fabra</subfield>
    <subfield code="a">Serra, Xavier</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">13906331564</subfield>
    <subfield code="z">md5:2137bbb2d32c1d60aa51e1301225f541</subfield>
    <subfield code="u">https://zenodo.org/record/820937/files/good-sounds.zip</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="y">Conference website</subfield>
    <subfield code="u">http://www.aes.org/events/138/</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2017-06-29</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="o">oai:zenodo.org:820937</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Music Technology Group - Universitat Pompeu Fabra</subfield>
    <subfield code="a">Romani Picas, Oriol</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Good-sounds dataset</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">http://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;&lt;strong&gt;General description:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;This dataset was created in the context of the Pablo project, partially funded by KORG Inc. It contains monophonic recordings of two kind of exercises: single notes and scales. The dataset was reported in the following article:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Romaní Picas O., Parra Rodriguez H., Dabiri D., Tokuda H., Hariya W., Oishi K., &amp;amp; Serra X."A real-time system for measuring sound goodness in instrumental sounds", 138th Audio Engineering Society Convention (2015). &lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The recordings were made in the Universitat Pompeu Fabra / Phonos recording studio by 15 different professional musicians, all of them holding a music degree and having some expertise in teaching. 12 different instruments were recorded using one or up to 4 different microphones (depending on the recording session). For all the instruments the whole set of playable semitones in the instrument is recorded several times with different tonal characteristics. Each note is recorded into a separate mono .flac audio file of 48kHz and 32 bits. The tonal characteristics are explained both in the the following section and the related publication.&lt;/p&gt;

&lt;p&gt;The audio files are organised in one directory for each recording session. In addition to the files, one SQLite database file is included. The structure of the database is related in the following section.&lt;/p&gt;

&lt;p&gt; &lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Database description:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The database is meant for organizing the sounds in a handy way. It is organised in four different tables: sounds, takes, packs and ratings.&lt;/p&gt;

&lt;p&gt;Sounds&lt;/p&gt;

&lt;p&gt;The table containing the sounds annotations.&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;&lt;strong&gt;id&lt;/strong&gt;&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;instrument &lt;/strong&gt;: flute, cello, clarinet, trumpet, violin, sax_alto, sax_tenor, sax_baritone, sax_soprano, oboe, piccolo, bass&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;note&lt;/strong&gt;&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;octave&lt;/strong&gt;&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;dynamics &lt;/strong&gt;: for some sounds, the musical notation of the loudness level (p, mf, f..)&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;recorded_at &lt;/strong&gt;: recording date and time&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;location &lt;/strong&gt;: recording place&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;player &lt;/strong&gt;: the musician who recorded it. For detailed information about the musicians please contact us.&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;bow_velocity &lt;/strong&gt;: for some string instruments the velocity of the bow (slow, medieum, fast)&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;bridge_position &lt;/strong&gt;: for some string instruments the position of the bow (tasto, middle, ponticello)&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;string &lt;/strong&gt;: for some string instruments the number of the string in which the sound it's played (1: lowest in pitch)&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;csv_file &lt;/strong&gt;: used for creation of the DB&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;csv_id &lt;/strong&gt;: used for creation of the DB&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;pack_filename &lt;/strong&gt;: used for creation of the DB&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;pack_id &lt;/strong&gt;: used for creation of the DB&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;attack &lt;/strong&gt;: for single notes, manual annotation of the onset in samples.&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;decay &lt;/strong&gt;: for single notes, manual annotation of the decay in samples.&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;sustain &lt;/strong&gt;: for single notes, manual annotation of the beginnig of the sustained part in samples.&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;release &lt;/strong&gt;: for single notes, manual annotation of the beginnig of the release part in samples.&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;offset &lt;/strong&gt;: for single notes, manual annotation of the offset in samples&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;reference &lt;/strong&gt;: 1 if sound was used to create the models in the good-sounds project, 0 if not.&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;klass &lt;/strong&gt;: user generated tags of the tonal qualities of the sound. They also contain information about the exercise, that could be single note or scale.
	&lt;ul&gt;
		&lt;li&gt;"good-sound": good examples of single note&lt;/li&gt;
		&lt;li&gt;"bad": bad example of one of the sound attributes defined in the project (please read the papers for a detailed explanation)&lt;/li&gt;
		&lt;li&gt;"scale-good": good example of a one octave scale going up and down (15 notes). If the scale is minor a tagged "minor" is also available.&lt;/li&gt;
		&lt;li&gt;"scale-bad": bad example scale of one of the sounds defined in the project. (15 notes up and down).&lt;/li&gt;
	&lt;/ul&gt;
	&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;comments &lt;/strong&gt;: if any&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;semitone &lt;/strong&gt;: midi note&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;pitch_reference &lt;/strong&gt;: the reference pitch&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Takes&lt;/p&gt;

&lt;p&gt;A sound can have several takes as some of them were recorded using different microphones at the same time. Each take has an associated audio file.&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;&lt;strong&gt;id&lt;/strong&gt;&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;microphone&lt;/strong&gt;&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;filename &lt;/strong&gt;: the name of the associated audio file&lt;/li&gt;
	&lt;li&gt;original_filename :&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;freesound_id &lt;/strong&gt;: for some sounds uploaded to freesound.org&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;sound_id &lt;/strong&gt;: the id of the sound in the DB&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;goodsound_id &lt;/strong&gt;: for some of the sounds available in good-sounds.org&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Packs&lt;/p&gt;

&lt;p&gt;A pack is a group of sounds from the same recording session. The audio files are organised in the *sound_files* directory in subfolders with the pack name to which they belong.&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;&lt;strong&gt;id&lt;/strong&gt;&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;name&lt;/strong&gt;&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;description&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Ratings&lt;/p&gt;

&lt;p&gt;Some musicians rated some sounds in a 0-10 goodness scale for the user evaluatio of the first project prototype. Please read the paper for more detailed information.&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;&lt;strong&gt;id&lt;/strong&gt;&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;mark&lt;/strong&gt;: the rate or score.&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;type&lt;/strong&gt;: the klass of the sound. Related to the tags of the sound.&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;created_at&lt;/strong&gt;&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;comments&lt;/strong&gt;&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;sound_id&lt;/strong&gt;&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;rater&lt;/strong&gt;: the musician who rated the sound.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;License:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;This work is licensed under the Creative Commons Attribution-NonCommercial 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc/4.0/ or send a letter to Creative Commons, PO Box 1866, Mountain View, CA 94042, USA.&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.820936</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.820937</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>
539
448
views
downloads
All versions This version
Views 539539
Downloads 448448
Data volume 6.2 TB6.2 TB
Unique views 498498
Unique downloads 301301

Share

Cite as