Calibrating neural networks for synthetic speech detection: A likelihood-ratio-based approach

Cuccovillo, Luca; Aichroth, Patrick; Köllmer, Thomas

doi:10.5281/zenodo.13366772

Published June 17, 2024 | Version v1

Conference paper Open

Calibrating neural networks for synthetic speech detection: A likelihood-ratio-based approach

1. Fraunhofer Institute for Digital Media Technology

In this paper, we introduce a calibration procedure designed to convert the uncalibrated output scores of neural networks for synthetic speech detection into calibrated and interpretable likelihood ratios. This procedure is based on the assumption that the networks subject to calibration are deterministic and have undergone training until they reached convergence. Provided these conditions are satisfied, it is then possible to transform their output values into likelihood ratios using a minimal set of validation and calibration data, eliminating the need for retraining the models. We successfully tested the entire workflow on a state-of-the-art network example, demonstrating not only its effectiveness in calibration but also its ability to enhance fault tolerance against inadequate inputs.

Files

AES_2024__LLR-Based-Calibration-for-Synthesis-Detection.pdf

Files (473.6 kB)

Name	Size	Download all
AES_2024__LLR-Based-Calibration-for-Synthesis-Detection.pdf md5:26772c92c5369a07500c2c5c64503a72	473.6 kB	Preview Download

Additional details

URL: https://aes2.org/publications/elibrary-page/?id=22628

Views

Downloads

Show more details

	All versions	This version
Views	36	36
Downloads	37	37
Data volume	18.0 MB	18.0 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

AES

Conference

AES International Conference on Audio Forensics , Denver, CO, USA

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: August 23, 2024
Modified: August 23, 2024

Calibrating neural networks for synthetic speech detection: A likelihood-ratio-based approach

Authors/Creators

Description

Files

AES_2024__LLR-Based-Calibration-for-Synthesis-Detection.pdf

Files (473.6 kB)

Additional details

Identifiers