Published September 20, 2024 | Version v1
Dataset Open

CLAP features for Audio Moment Retrieval

Description

This page includes CLAP features of three datasets used in Language-based audio moment retrieval [1].

  • Clotho-Moment
  • UnAV100-subset
  • TUT Sound Events 2017

Raw wav files are also publicly available here.

[1] H. Munakata, T. Nishimura, S. Nakada, T. Komatsu, "Language-based Audio Moment Retrieval", 2024, under review.

How to Use

We can train/evaluate audio moment retrieval models using these features in Lighthouse.
Please check the instructions of Lighthouse.

  1. Unzip the file with the following commands
    Clotho-moment: 
    for file in clotho-moment_features.tar.part-*.gz; do gunzip "$file"; done
    clotho-moment_features.tar.part-* > clotho-moment_features.tar
    tar -xvf clotho-moment_features.tar

    UnAV100-subset, TUT Sound Events 2017: 
    tar -xvf tut2017_features.tar.gz
    tar -xvf unav100-subset_features.tar.gz
  2. Set symbolic links in Lighthouse
    ln -s features/{dataset_name} {lighthouse_dir}/features
  3. Train the model
    python training/train.py --model qd_detr --dataset clotho-moment --feature clap

  4. Evaluate the model
    model=qd_detr
    dataset=unav100-subset
    feature=clap
    model_path={lighthouse_dir}/results/qd_detr/clotho-moment/clap/best.ckpt
    eval_split_name=val
    eval_path=data/unav100-subset/unav100-subset_test_release.jsonl

    python training/evaluate.py \
    --model $model \
    --dataset $dataset \
    --feature $feature \
    --model_path $model_path \
    --eval_split_name $eval_split_name \
    --eval_path $eval_path




Files

Files (10.6 GB)

Name Size Download all
md5:025e3ba3d28c010ae4bb39ec5258fd97
985.6 MB Download
md5:fc9070b99c9893972a6754e14a507493
987.2 MB Download
md5:02fe827f5478e90e7eba077051ded10f
990.9 MB Download
md5:54e382b6e4b668613f1be69d10c9128a
990.9 MB Download
md5:c11635e6ded5f00e9509d3933fdfefec
990.9 MB Download
md5:559fc7bfb138bdc3aabb4eb7c4111370
990.9 MB Download
md5:59cab1804ff028ae043721ac7d8da310
990.8 MB Download
md5:683a1c5d6e9e8f2bb5e58a7d38757c80
990.9 MB Download
md5:5a818c47f4637d234ddb281d8ff4f17a
990.9 MB Download
md5:823011fcbe7278dffdd3fc8e01aef4da
990.9 MB Download
md5:136b4420e26e4606e25924b0c23e8387
684.3 MB Download
md5:4d9fe3abef550c5f16495080fca172cf
7.7 MB Download
md5:96f0e63d3571c8145d768c723f949a9c
13.3 MB Download

Additional details

Related works

Is derived from
Dataset: 10.5281/zenodo.3490684 (DOI)