Masking Speech Contents by Random Splicing: is Emotional Expression Preserved?

Burkhardt, Felix; Derington, Anna; Kahlau, Matthias; Scherer, Klaus; Eyben, Florian; Schuller, Bjorn

doi:10.5281/zenodo.10664711

Published May 5, 2023 | Version v1

Conference paper Open

Masking Speech Contents by Random Splicing: is Emotional Expression Preserved?

1. audEERING GmbH, Germany
2. University of Geneva, Switzerland
3. Chair EIHW, University of Augsburg, Germany
4. GLAM, Imperial College London, UK

We discuss the influence of random splicing on the perception of emotional expression in speech signals. Random splicing is the randomized reconstruction of short audio snippets with the aim to obfuscate the speech contents. A part of the German parliament recordings has been random spliced and both versions – the original and the scrambled ones – manually labeled with respect to the arousal, valence and dominance dimensions. Additionally, we run a state-of-the-art transformer-based pre-trained emotional model on the data. We find sufficiently high correlation for the annotations and predictions of emotional dimensions between both sample versions to be confident that machine learners can be trained with random spliced data.

Files

Random_splicing_ICASSP-2.pdf

Files (320.1 kB)

Name	Size	Download all
Random_splicing_ICASSP-2.pdf md5:841c5e97cb6af0710de759a0f2e74b27	320.1 kB	Preview Download

Additional details

DOI: 10.1109/ICASSP49357.2023.10097094

European Commission
MARVEL – Multimodal Extreme Scale Data Analytics for Smart Cities Environments 957337
European Commission
ECoWeB – Assessing and Enhancing Emotional Competence for Well-Being (ECoWeB) in the Young: A principled, evidence-based, mobile-health approach to prevent mental disorders and promote mental well-being 754657

	All versions	This version
Views	28	28
Downloads	24	24
Data volume	8.0 MB	8.0 MB

Masking Speech Contents by Random Splicing: is Emotional Expression Preserved?

Files

Random_splicing_ICASSP-2.pdf

Files (320.1 kB)

Additional details

Identifiers

Funding

Masking Speech Contents by Random Splicing: is Emotional Expression Preserved?

Creators

Description

Files

Random_splicing_ICASSP-2.pdf

Files (320.1 kB)

Additional details

Identifiers

Funding