coqui-ai/TTS: v0.0.13
Creators
- Eren Gölge1
- Alexander Korolev
- Edresson Casanova2
- Thomas Werkmeister
- Thorsten Müller
- Reuben Morais
- WeberJulian
- Branislav Gerazov3
- Kirian Guiller
- Thorben Hellweg4
- Neil Stoker
- Jörg Thalheim5
- Nicolas Müller
- Rishikesh (ऋषिकेश)6
- Adonis Pujols
- bgerazov
- mittimithai
- Markus Toman
- geneing
- Guy Elsmore-Paddock7
- Martin Weinelt
- QP Hou
- jyegerlehner
- Anand...
- Bajibabu Bollepalli8
- forcecore
- Tom Rochette
- Yves-Noel Weweler
- fatihkiralioglu
- 1. Coqui.ai
- 2. University of São Paulo (USP)
- 3. Faculty of Electrical Engineering and Information Technologies
- 4. Airbus Defence & Space
- 5. no
- 6. Open Source
- 7. Inveniem
- 8. Aalto University
Description
SpeakerManagerclass for handling multi-speaker model management and interfacingspeaker.jsonfile.- Enabling multi-speaker models with
ttsandtts-serverendpoints. (:crown: @kirianguiller ) - Allow choosing a different
noise scalefor GlowTTS at inference. - Glow-TTS updates to import SC-Glow Models.
- Fixing windows support (:crown: @WeberJulian )
- Refactoring :frog: TTS installation and allow selecting different scopes (
all, tf, notebooks)for installation depending on the specific needs.
- SC-GlowTTS multi-speaker English model from our work https://arxiv.org/abs/2104.05557 (:crown: @Edresson )
- HiFiGAN vocoder finetuned for the above model.
- Tacotron DDC Non-Binary English model using Accenture's Sam dataset.
- HiFiGAN vocoder trained for the models above.
:bulb: All the models below are available by tts or tts-server endpoints on CLI as explained here.
Models with ✨️ below are new with this release.
Language Dataset Model Name Model Type TTS version Download:sparkles: English (non-binary) | sam (acccenture) | Tacotron2-DDC | tts | :smile: v0.0.13 | :floppy_disk:
:sparkles: English (multi-speaker) | VCTK | SC-GlowTTS | tts | :smile: v0.0.13| :floppy_disk: |
English | LJSpeech | Tacotron-DDC | tts | v0.0.12| :floppy_disk:
German | Thorsten-DE | Tacotron-DCA | tts | v0.0.11 |:floppy_disk:
German | Thorsten-DE | Wavegrad | vocoder |v0.0.11 |:floppy_disk:
English | LJSpeech | SpeedySpeech | tts | v0.0.10 |:floppy_disk:
English | EK1 | Tacotron2 | tts |v0.0.10 |:floppy_disk:
Russian | RUSLAN | TacotronDDC | tts | v0.0.10 |:floppy_disk:
Dutch | MAI | TacotronDDC | tts | v0.0.10 |:floppy_disk:
Chinese | Baker | TacotronDDC-GST | tts | v0.0.10 |:floppy_disk:
English | LJSpeech | TacotronDCA | tts |v0.0.9 |:floppy_disk:
English | LJSpeech | Glow-TTS | tts |v0.0.9 |:floppy_disk:
Spanish | M-AILabs | TacotronDDC | tts |v0.0.9 |:floppy_disk:
French | M_AILabs | TacotronDDC | tts |v0.0.9 |:floppy_disk:
Dutch | MAI | TacotronDDC | tts | v0.0.10 |:floppy_disk:
:sparkles: English | sam (accenture) | HiFiGAN | vocoder | :smile: v0.0.13| :floppy_disk:
:sparkles: English | VCTK | HiFiGAN | vocoder | :smile: v0.0.13| :floppy_disk:
English | LJSpeech | HiFiGAN | vocoder | v0.0.12| :floppy_disk:
English | EK1 | WaveGrad | vocoder | v0.0.10 |:floppy_disk:
Dutch | MAI | ParallelWaveGAN | vocoder | v0.0.10 |:floppy_disk:
English | LJSpeech | MB-MelGAN | vocoder |v0.0.9 |:floppy_disk:
:earth_africa: Multi-Lang | LibriTTS | FullBand-MelGAN | vocoder |v0.0.9 |:floppy_disk:
:earth_africa: Multi-Lang | LibriTTS | WaveGrad | vocoder |v0.0.9 |:floppy_disk:
Files
coqui-ai/TTS-v0.0.13.zip
Files
(12.1 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:c55844606f1367af1030b5197f6b9bbf
|
12.1 MB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/coqui-ai/TTS/tree/v0.0.13 (URL)