There is a newer version of the record available.

Published June 3, 2024 | Version v2
Model Open

Tailored Design of Audio-Visual Speech Recognition Models using Branchformers

  • 1. ROR icon Universitat Politècnica de València

Description

Official model checkpoints for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers". Checkpoints for our audio-only, video-only, and audio-visual models are available along with their corresponding model configuration files.

Source code to evaluate our models, fine-tune them, and train new ones for your database of interest can be found in our official GitHub repository.

Files

model_checkpoints.zip

Files (3.1 GB)

Name Size Download all
md5:a020f20eedfb39f2b524171127c8bae9
3.1 GB Preview Download

Additional details

Funding

Generalitat Valenciana
Grant CIACIF/2021/295
Ministerio de Ciencia, Innovación y Universidades
Grant PID2021-124719OB-I00