Published October 31, 2022
| Version 2
Dataset
Open
Text to audio grounding (TAG) dataset: AudioGrounding
Authors/Creators
Description
AudioGrounding dataset, including audio files and timestamp annotations.
Changes in version 2: The train/validation/test sets are re-split. The validation and test annotations are refined.
----------------------------------------------------------
References
[1] Xuenan Xu, Heinrich Dinkel, Mengyue Wu and Kai Yu. "Text-to-audio grounding: Building correspondence between captions and sound events." In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2021, pp. 606-610.