Thorsten-Voice Dataset 2022.10

Müller, Thorsten; Kreutz, Dominik

doi:10.5281/zenodo.7265581

Published October 30, 2022 | Version 1.0

Dataset Open

Thorsten-Voice Dataset 2022.10

1. Thorsten-Voice

The goal of project "Thorsten-Voice" is to provide voice datasets and TTS models for free and high quality german artificial voice. This dataset "Thorsten-Voice dataset 2022.10" is a neutrally spoken voice dataset recorded by Thorsten Müller, audio optimized by Dominik Kreutz and licenced under CC0 to provide it for anybody without any financial or licence struggle.

"I contribute my personal voice as a person believing in a world where all people are equal. No matter of gender, sexual orientation, religion, skin color and geocoordinates of birth location. A global world where everybody is warmly welcome on any place on this planet and open and free knowledge and education is available to everyone." (Thorsten Müller)

Dataset details:

ljspeech file and directory structure
12.450 recorded phrases (wav files)
more than 11 hours of pure audio
samplerate 22.050Hz
mono
normalized to -24dB
no silence at beginning/ending
avg spoken chars per second: 17,5

See more details on my Github page or Thorsten-Voice project website.

Files

ThorstenVoice-Dataset_2022.10.zip

Files (1.4 GB)

Name	Size
ThorstenVoice-Dataset_2022.10.zip md5:c2c2cb0d8a2b3b240e140d9213cd39b8	1.4 GB	Preview Download

Additional details

Is supplement to: Dataset: 10.5281/zenodo.5525342 (DOI); Dataset: 10.5281/zenodo.5525023 (DOI)

	All versions	This version
Views	5,113	5,069
Downloads	7,725	7,683
Data volume	16.3 TB	16.3 TB

Thorsten-Voice Dataset 2022.10

Authors/Creators

Description

Files

ThorstenVoice-Dataset_2022.10.zip

Files (1.4 GB)

Additional details

Related works