Published December 18, 2023 | Version v1
Conference paper Open

Speech-based Age and Gender Prediction with Transformers

  • 1. audEERING GmbH, Germany
  • 2. Chair EIHW, University of Augsburg, Germany
  • 3. GLAM, Imperial College London, UK

Description

We report on the curation of several publicly available datasets for age and gender prediction. Furthermore, we present experiments to predict age and gender with models based on a pre-trained wav2vec 2.0. Depending on the dataset, we achieve an MAE between 7.1 years and 10.8 years for age, and at least 91.1%ACC for gender (female, male, child). Compared to a modelling approach built on hand-crafted features, our proposed system shows an improvement of 9% UAR for age and 4% UAR for gender. To make our findings reproducible, we release the best performing model to the community as well as the sample lists of the data splits.

Files

Age_ITG.pdf

Files (226.5 kB)

Name Size Download all
md5:efa516d919310136d973a439c19cb13e
226.5 kB Preview Download

Additional details

Identifiers

Funding

European Commission
MARVEL – Multimodal Extreme Scale Data Analytics for Smart Cities Environments 957337
European Commission
EASIER – Intelligent Automatic Sign Language Translation 101016982