DCASE2024 Task6 Baseline - Automated Audio Captioning (ConvNeXt-Transformer)

Labbé, Etienne

doi:10.5281/zenodo.10849427

Published April 1, 2024 | Version v1

Model Open

DCASE2024 Task6 Baseline - Automated Audio Captioning (ConvNeXt-Transformer)

Labbé, Etienne (Contact person)¹

1. Institut de Recherche en Informatique de Toulouse

DCASE2024 Task6 Baseline: ConvNeXt-Transformer model for Automated Audio Captioning.

This model is trained on the Clotho dataset
Extracts features using ConvNeXt
System reaches 29.6% SPIDEr-FL score on Clotho-eval (also named development-testing in DCASE)

This model requires representation extracted using a ConvNeXt pretrained for audio classification, available here under the filename convnext_tiny_465mAP_BL_AC_70kit.pth.

Files

tokenizer.json

Files (148.3 MB)

Name	Size	Download all
epoch_192-step_001544-mode_min-val_loss_3.3758.ckpt md5:9514a8e6fa547bd01fb1badde81c6d10	148.2 MB	Download
tokenizer.json md5:ee3fef19f7d0891d820d84035483a900	101.4 kB	Preview Download

Additional details

Requires: Model: 10.5281/zenodo.8020843 (DOI)

Repository URL: https://github.com/Labbeti/dcase2024-task6-baseline
Programming language: Python

261

Views

911

Downloads

Show more details

	All versions	This version
Views	261	261
Downloads	911	911
Data volume	73.4 GB	73.4 GB

More info on how stats are collected....

DOI

Resource type

Model

Publisher

Zenodo

Languages

English

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: March 23, 2024
Modified: April 1, 2024

DCASE2024 Task6 Baseline - Automated Audio Captioning (ConvNeXt-Transformer)

Files

tokenizer.json

Files (148.3 MB)

Additional details

Related works

Software

DCASE2024 Task6 Baseline - Automated Audio Captioning (ConvNeXt-Transformer)

Creators

Description

Files

tokenizer.json

Files (148.3 MB)

Additional details

Related works

Software