Dataset Open Access

Freesound content analyzed with Audio Commons Audio Extractor V2

Font, Frederic; Bogdanov, Dmitry

Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="" xmlns:oai_dc="" xmlns:xsi="" xsi:schemaLocation="">
  <dc:creator>Font, Frederic</dc:creator>
  <dc:creator>Bogdanov, Dmitry</dc:creator>
  <dc:description>This dataset contains the outputs of running the second prototype of the Audio Commons Audio Extractor (ACExtractorV2) over 292k clips of the Freesound collection. This version of the audio extractor is described in Deliverable D4.7 of the AudioCommons project, and includes several music properties such as pitch, key and tempo (along with their confidence measures) which can be applied to music samples and music loops. It also includes prototype versions of the timbral models described in Deliverable 5.6.

The preset dataset is structured as a single JSON file with a dictionary in which keys correspond to Freesound sound IDs. For each sound, the full output of the Audio Commons Audio Extractor is provided as another dictionary with the following keys: booming, note_midi, note_confidence, brightness, log_attack_time, sharpness, tonality_confidence, single_event, tempo, roughness, dynamic_range, depth, tempo_confidence, loop, note_frequency, temporal_centroid, loudness, tonality, warmth, hardness, note_name. In some cases, keys might be missing if the audio extractor could not produce a valid output for a specific property and a given file. More information about these audio properties can be found in the aforementioned deliverables and software tools.</dc:description>
  <dc:title>Freesound content analyzed with Audio Commons Audio Extractor V2</dc:title>
All versions This version
Views 157157
Downloads 2323
Data volume 901.9 MB901.9 MB
Unique views 140140
Unique downloads 1818


Cite as