An Open Dataset of Synthetic Speech
Creators
Description
This paper introduces a multilingual, multispeaker dataset composed of synthetic and natural speech, designed to foster research and benchmarking in synthetic speech detection. The dataset encompasses 18,993 audio utterances synthesized from text, alongside with their corresponding natural equivalents, representing approximately 17 hours of synthetic audio data. The dataset features synthetic speech generated by 156 voices spanning three languages, namely, English, German, and Spanish, with a balanced gender representation. It targets state-of-the-art synthesis methods, and has been released with a license allowing seamless extension and redistribution by the research community.
Notes
Files
IEEE_WIFS_2023___Dataset_of_Synthetic_Speech.pdf
Files
(128.5 kB)
Name | Size | Download all |
---|---|---|
md5:51b6550c813d5c5cbf71c8c2934b21b9
|
128.5 kB | Preview Download |
Additional details
Identifiers
Related works
- Describes
- Dataset: 10.5281/zenodo.8370668 (DOI)
Funding
Dates
- Accepted
-
2023-09-15