Dataset Open Access
This dataset contains the outputs of running the second prototype of the Audio Commons Audio Extractor (ACExtractorV2) over 292k clips of the Freesound collection. This version of the audio extractor is described in Deliverable D4.7 of the AudioCommons project, and includes several music properties such as pitch, key and tempo (along with their confidence measures) which can be applied to music samples and music loops. It also includes prototype versions of the timbral models described in Deliverable 5.6.
The preset dataset is structured as a single JSON file with a dictionary in which keys correspond to Freesound sound IDs. For each sound, the full output of the Audio Commons Audio Extractor is provided as another dictionary with the following keys: booming, note_midi, note_confidence, brightness, log_attack_time, sharpness, tonality_confidence, single_event, tempo, roughness, dynamic_range, depth, tempo_confidence, loop, note_frequency, temporal_centroid, loudness, tonality, warmth, hardness, note_name. In some cases, keys might be missing if the audio extractor could not produce a valid output for a specific property and a given file. More information about these audio properties can be found in the aforementioned deliverables and software tools.