Published September 20, 2024
| Version v1
Dataset
Open
CLAP features for Audio Moment Retrieval
Creators
- 1. LY Corporation
Description
This page includes CLAP features of three datasets used in Language-based audio moment retrieval [1].
- Clotho-Moment
- UnAV100-subset
- TUT Sound Events 2017
Raw wav files are also publicly available here.
[1] H. Munakata, T. Nishimura, S. Nakada, T. Komatsu, "Language-based Audio Moment Retrieval", 2024, under review.
How to Use
We can train/evaluate audio moment retrieval models using these features in Lighthouse.
Please check the instructions of Lighthouse.
- Unzip the file with the following commands
Clotho-moment:
for file in clotho-moment_features.tar.part-*.gz; do gunzip "$file"; done
clotho-moment_features.tar.part-* > clotho-moment_features.tar
tar -xvf clotho-moment_features.tar
UnAV100-subset, TUT Sound Events 2017:
tar -xvf tut2017_features.tar.gz
tar -xvf unav100-subset_features.tar.gz
- Set symbolic links in Lighthouse
ln -s features/{dataset_name} {lighthouse_dir}/features
- Train the model
python training/train.py --model qd_detr --dataset clotho-moment --feature clap
- Evaluate the model
model=qd_detr
dataset=unav100-subset
feature=clap
model_path={lighthouse_dir}/results/qd_detr/clotho-moment/clap/best.ckpt
eval_split_name=val
eval_path=data/unav100-subset/unav100-subset_test_release.jsonl
python training/evaluate.py \
--model $model \
--dataset $dataset \
--feature $feature \
--model_path $model_path \
--eval_split_name $eval_split_name \
--eval_path $eval_path
Files
Files
(10.6 GB)
Name | Size | Download all |
---|---|---|
md5:025e3ba3d28c010ae4bb39ec5258fd97
|
985.6 MB | Download |
md5:fc9070b99c9893972a6754e14a507493
|
987.2 MB | Download |
md5:02fe827f5478e90e7eba077051ded10f
|
990.9 MB | Download |
md5:54e382b6e4b668613f1be69d10c9128a
|
990.9 MB | Download |
md5:c11635e6ded5f00e9509d3933fdfefec
|
990.9 MB | Download |
md5:559fc7bfb138bdc3aabb4eb7c4111370
|
990.9 MB | Download |
md5:59cab1804ff028ae043721ac7d8da310
|
990.8 MB | Download |
md5:683a1c5d6e9e8f2bb5e58a7d38757c80
|
990.9 MB | Download |
md5:5a818c47f4637d234ddb281d8ff4f17a
|
990.9 MB | Download |
md5:823011fcbe7278dffdd3fc8e01aef4da
|
990.9 MB | Download |
md5:136b4420e26e4606e25924b0c23e8387
|
684.3 MB | Download |
md5:4d9fe3abef550c5f16495080fca172cf
|
7.7 MB | Download |
md5:96f0e63d3571c8145d768c723f949a9c
|
13.3 MB | Download |
Additional details
Related works
- Is derived from
- Dataset: 10.5281/zenodo.3490684 (DOI)