Dataset Open Access

Freesound content analyzed with Audio Commons Audio Extractor V2

Font, Frederic; Bogdanov, Dmitry

This dataset contains the outputs of running the second prototype of the Audio Commons Audio Extractor (ACExtractorV2) over 292k clips of the Freesound collection. This version of the audio extractor is described in Deliverable D4.7 of the AudioCommons project, and includes several music properties such as pitchkey and tempo (along with their confidence measures) which can be applied to music samples and music loops. It also includes prototype versions of the timbral models described in Deliverable 5.6.

The preset dataset is structured as a single JSON file with a dictionary in which keys correspond to Freesound sound IDs. For each sound, the full output of the Audio Commons Audio Extractor is provided as another dictionary with the following keys: booming, note_midi, note_confidence, brightness, log_attack_time, sharpness, tonality_confidence, single_event, tempo, roughness, dynamic_range, depth, tempo_confidence, loop, note_frequency, temporal_centroid, loudness, tonality, warmth, hardness, note_name. In some cases, keys might be missing if the audio extractor could not produce a valid output for a specific property and a given file. More information about these audio properties can be found in the aforementioned deliverables and software tools.

Files (39.2 MB)
Name Size
Freesound_ACExtractorV2_292k.json.zip
md5:e072b3472d569f3663fe6aeb6472433c
39.2 MB Download
37
4
views
downloads
All versions This version
Views 3737
Downloads 44
Data volume 156.8 MB156.8 MB
Unique views 3232
Unique downloads 44

Share

Cite as