Published June 6, 2025 | Version v2
Dataset Open

STOPA: A Dataset of Systematic VariaTion Of DeePfake Audio for Open-Set Source Tracing and Attribution

  • 1. ROR icon Brno University of Technology
  • 2. ROR icon University of Eastern Finland

Description

STOPA is a dataset for source tracing and attribution of deepfake audio, identifying which synthesis system generated a given utterance. It includes over 700,000 synthetic speech samples, generated using 13 distinct systems with controlled variation across 8 acoustic models and 6 vocoders.

The dataset follows ASVspoof2019 Logical Access protocols and uses speakers from the VCTK corpus. It supports open-world evaluation, where test utterances are compared against individual source hypotheses without assuming closed-set conditions. Rich metadata and pairwise trial protocols enable fine-grained attribution at the level of attack, acoustic model, or vocoder.

All audio is provided as 16-bit PCM WAV at 16 kHz. Metadata includes transcription, silence regions, WER, and system labels.

License: CC BY 4.0
Audio type: Synthetic only
Language: English (VCTK-based)

Files

README.txt

Files (44.9 GB)

Name Size Download all
md5:1ed06fb2e675e7d25654acf8a803a732
12.3 kB Preview Download
md5:14de06c76901e3336a426af7176e3f8d
12.9 GB Download
md5:073ffd7e9978327fc8bdb527ac707640
12.9 GB Download
md5:cad058cfacdba5b5ade6a6a6248a34bc
12.9 GB Download
md5:5dc41b6eb8783f79379ed178eae85929
6.3 GB Download

Additional details

Related works

Is described by
10.21437/Interspeech.2025-2065 (DOI)

Funding

Brno University of Technology
Reliable, Secure, and Intelligent Computer Systems FIT- S-23-8151
Research Council of Finland
SPEECHFAKES: Generalized Voice Anti-Spoofing and Voice Biometrics

Dates

Accepted
2025-05-19
Accepted to Interspeech 2025