Published December 12, 2023
| Version v1
Model
Open
conditionaudiogen checkpoint
Creators
Description
Inspired by the practice in the image field which evaluates the location of generated object using a object detection model, we employ a SED system to provide event-based measures (Eb) and clip-level macro F1 score (At) to assess the temporal order control capability. These metrics evaluate the presence of sound events in the generated audio, as well as the onsets and offsets using the first-place SED system in DCASE 2022 Task 4, PB-SED (https://github.com/fgnt/pb_sed/), on AudioCondition test set.
The checkpint of PB-SED that we used for evaluating is publicly available here.
Files
sed_checkpoint.zip
Files
(580.0 MB)
Name | Size | Download all |
---|---|---|
md5:d641ee40afecd73280baf5460334f19e
|
580.0 MB | Preview Download |