Published July 12, 2024 | Version v1
Conference paper Open

Multimodal, Multi-Class Bias Mitigation for Predicting Speaker Confidence

  • 1. Bielefeld University, Germany
  • 2. University of Alberta, Canada

Description

Projecting confidence during conversation or presentation is a critical skill. To effectively display confidence, speakers must employ a blend of verbal and non-verbal signals. A predictive model that leverages rich multimodal cues to measure a speaker¿½fs confidence must also mitigate biases that develop through data labelling practices, inherent imbalances in the demographic distribution, or biases introduced into the model during the training process. Fairly predicting the confidence of speakers across differing backgrounds enables more accurate and actionable feedback to a larger population of speakers. This paper introduces a set of approaches for bias mitigation for multimodal, multi-class confidence prediction of adult speakers in a work-like setting. We evaluate the extent to which bias mitigation techniques improve the performance of a multimodal confidence classifier with a dataset of 233 2-minute videos. Experimental results suggest that by bounding the loss across perceived races, genders, accents, and ages, multimodal models can significantly outperform unmitigated baselines. The implications, including automated feedback of speaker confidence, are discussed.

Files

2024.EDM-industry.63.pdf

Files (404.5 kB)

Name Size Download all
md5:580d20459f54a1e0038372a6599bb47d
404.5 kB Preview Download