Published December 12, 2023 | Version v1
Model Open

conditionaudiogen checkpoint

Creators

Description

Inspired by the practice in the image field which evaluates the location of generated object using a object detection model, we employ a SED system to provide event-based measures (Eb) and clip-level macro F1 score (At) to assess the temporal order control capability. These metrics evaluate the presence of sound events in the generated audio, as well as the onsets and offsets using the first-place SED system in DCASE 2022 Task 4, PB-SED (https://github.com/fgnt/pb_sed/), on AudioCondition test set.

The checkpint of PB-SED that we used for evaluating is publicly available here.

Files

sed_checkpoint.zip

Files (580.0 MB)

Name Size Download all
md5:d641ee40afecd73280baf5460334f19e
580.0 MB Preview Download