An Open Dataset of Synthetic Speech

Yaroshchuk, Artem; Papastergiopoulos, Christoforos; Cuccovillo, Luca; Aichroth, Patrick; Konstantinos, Konstantinos; Tzovaras, Dimitrios

doi:10.5281/zenodo.10124946

Published December 4, 2023 | Version v1

Conference paper Open

An Open Dataset of Synthetic Speech

1. Fraunhofer Institute for Digital Media Technology
2. Centre for Research and Technology Hellas

This paper introduces a multilingual, multispeaker dataset composed of synthetic and natural speech, designed to foster research and benchmarking in synthetic speech detection. The dataset encompasses 18,993 audio utterances synthesized from text, alongside with their corresponding natural equivalents, representing approximately 17 hours of synthetic audio data. The dataset features synthetic speech generated by 156 voices spanning three languages, namely, English, German, and Spanish, with a balanced gender representation. It targets state-of-the-art synthesis methods, and has been released with a license allowing seamless extension and redistribution by the research community.

Notes

The final version of the paper published by IEEE is available online at https://doi.org/10.1109/WIFS58808.2023.10374863.

Files

IEEE_WIFS_2023___Dataset_of_Synthetic_Speech.pdf

Files (128.5 kB)

Name	Size	Download all
IEEE_WIFS_2023___Dataset_of_Synthetic_Speech.pdf md5:51b6550c813d5c5cbf71c8c2934b21b9	128.5 kB	Preview Download

Additional details

DOI: 10.1109/WIFS58808.2023.10374863

Describes: Dataset: 10.5281/zenodo.8370668 (DOI)

European Commission
AI4Media - A European Excellence Centre for Media, Society and Democracy 951911
European Commission
vera.ai - vera.ai: VERification Assisted by Artificial Intelligence 101070093

Accepted: 2023-09-15

859

Views

562

Downloads

Show more details

	All versions	This version
Views	859	859
Downloads	562	562
Data volume	86.6 MB	86.6 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

IEEE

Conference

IEEE International Workshop on Information Forensics and Security (WIFS) , Nuremberg, Germany, 4-7 December 2023

Languages

English

License: © IEEE 2023

Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Technical metadata

Created: November 14, 2023
Modified: July 10, 2024

IEEE_WIFS_2023___Dataset_of_Synthetic_Speech.pdf

Files (128.5 kB)

Identifiers

Related works

Funding

Dates

An Open Dataset of Synthetic Speech

Authors/Creators

Description

Notes

Files

IEEE_WIFS_2023___Dataset_of_Synthetic_Speech.pdf

Files (128.5 kB)

Additional details

Identifiers

Related works

Funding

Dates