Dataset Open Access

Freesound content analyzed with Audio Commons Audio Extractor V2

Font, Frederic; Bogdanov, Dmitry


Citation Style Language JSON Export

{
  "publisher": "Zenodo", 
  "DOI": "10.5281/zenodo.2546812", 
  "language": "eng", 
  "title": "Freesound content analyzed with Audio Commons Audio Extractor V2", 
  "issued": {
    "date-parts": [
      [
        2019, 
        1, 
        22
      ]
    ]
  }, 
  "abstract": "<p>This dataset contains the outputs of running the second prototype of the&nbsp;<a href=\"https://github.com/AudioCommons/ac-audio-extractor\">Audio Commons Audio Extractor</a>&nbsp;(ACExtractorV2)&nbsp;over 292k clips of the <a href=\"https://freesound.org\">Freesound</a> collection. This version of the&nbsp;audio extractor is described in <a href=\"https://www.audiocommons.org/assets/files/AC-WP4-UPF-D4.7%20Second%20prototype%20tool%20for%20the%20automatic%20semantic%20description%20of%20music%20samples.pdf\">Deliverable D4.7</a> of the AudioCommons project, and includes several music properties such as&nbsp;<strong>pitch</strong>,&nbsp;<strong>key</strong>&nbsp;and&nbsp;<strong>tempo</strong>&nbsp;(along with their confidence measures) which can be applied to music samples and music loops. It also includes prototype versions of the <a href=\"https://github.com/AudioCommons/timbral_models\">timbral models</a> described in <a href=\"https://www.audiocommons.org/assets/files/AC-WP5-SURREY-D5.6%20Second%20prototype%20of%20timbral%20characterisation%20tool%20for%20semantically%20annotating%20non-musical%20content.pdf\">Deliverable 5.6</a>.</p>\n\n<p>The preset dataset is structured as a single JSON file with a dictionary in which keys correspond to Freesound sound IDs. For each sound, the full output of the Audio Commons Audio Extractor is provided as another dictionary with the following keys:&nbsp;booming, note_midi, note_confidence, brightness, log_attack_time, sharpness, tonality_confidence, single_event, tempo, roughness, dynamic_range, depth, tempo_confidence, loop, note_frequency, temporal_centroid, loudness, tonality, warmth, hardness, note_name. In some cases, keys might be missing if the audio extractor could not produce a valid output for a specific property and a given file. More information about these audio properties can be found in the aforementioned deliverables and software tools.</p>", 
  "author": [
    {
      "family": "Font, Frederic"
    }, 
    {
      "family": "Bogdanov, Dmitry"
    }
  ], 
  "version": "1.0", 
  "type": "dataset", 
  "id": "2546812"
}
158
23
views
downloads
All versions This version
Views 158158
Downloads 2323
Data volume 901.9 MB901.9 MB
Unique views 141141
Unique downloads 1818

Share

Cite as