Dataset Open Access

Freesound content analyzed with Audio Commons Audio Extractor V2

Font, Frederic; Bogdanov, Dmitry


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">eng</subfield>
  </datafield>
  <controlfield tag="005">20200124192354.0</controlfield>
  <controlfield tag="001">2546812</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Music Technology Group, Universitat Pompeu Fabra</subfield>
    <subfield code="a">Bogdanov, Dmitry</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">39212378</subfield>
    <subfield code="z">md5:e072b3472d569f3663fe6aeb6472433c</subfield>
    <subfield code="u">https://zenodo.org/record/2546812/files/Freesound_ACExtractorV2_292k.json.zip</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2019-01-22</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="o">oai:zenodo.org:2546812</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Music Technology Group, Universitat Pompeu Fabra</subfield>
    <subfield code="0">(orcid)0000-0002-4360-3210</subfield>
    <subfield code="a">Font, Frederic</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Freesound content analyzed with Audio Commons Audio Extractor V2</subfield>
  </datafield>
  <datafield tag="536" ind1=" " ind2=" ">
    <subfield code="c">688382</subfield>
    <subfield code="a">Audio Commons: An Ecosystem for Creative Reuse of Audio Content</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;This dataset contains the outputs of running the second prototype of the&amp;nbsp;&lt;a href="https://github.com/AudioCommons/ac-audio-extractor"&gt;Audio Commons Audio Extractor&lt;/a&gt;&amp;nbsp;(ACExtractorV2)&amp;nbsp;over 292k clips of the &lt;a href="https://freesound.org"&gt;Freesound&lt;/a&gt; collection. This version of the&amp;nbsp;audio extractor is described in &lt;a href="https://www.audiocommons.org/assets/files/AC-WP4-UPF-D4.7%20Second%20prototype%20tool%20for%20the%20automatic%20semantic%20description%20of%20music%20samples.pdf"&gt;Deliverable D4.7&lt;/a&gt; of the AudioCommons project, and includes several music properties such as&amp;nbsp;&lt;strong&gt;pitch&lt;/strong&gt;,&amp;nbsp;&lt;strong&gt;key&lt;/strong&gt;&amp;nbsp;and&amp;nbsp;&lt;strong&gt;tempo&lt;/strong&gt;&amp;nbsp;(along with their confidence measures) which can be applied to music samples and music loops. It also includes prototype versions of the &lt;a href="https://github.com/AudioCommons/timbral_models"&gt;timbral models&lt;/a&gt; described in &lt;a href="https://www.audiocommons.org/assets/files/AC-WP5-SURREY-D5.6%20Second%20prototype%20of%20timbral%20characterisation%20tool%20for%20semantically%20annotating%20non-musical%20content.pdf"&gt;Deliverable 5.6&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;The preset dataset is structured as a single JSON file with a dictionary in which keys correspond to Freesound sound IDs. For each sound, the full output of the Audio Commons Audio Extractor is provided as another dictionary with the following keys:&amp;nbsp;booming, note_midi, note_confidence, brightness, log_attack_time, sharpness, tonality_confidence, single_event, tempo, roughness, dynamic_range, depth, tempo_confidence, loop, note_frequency, temporal_centroid, loudness, tonality, warmth, hardness, note_name. In some cases, keys might be missing if the audio extractor could not produce a valid output for a specific property and a given file. More information about these audio properties can be found in the aforementioned deliverables and software tools.&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.2546811</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.2546812</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>
158
23
views
downloads
All versions This version
Views 158158
Downloads 2323
Data volume 901.9 MB901.9 MB
Unique views 141141
Unique downloads 1818

Share

Cite as