Published October 11, 2024 | Version v6
Dataset Open

Ressources for End-to-End French Text-to-Speech Blizzard challenge

  • 1. GIPSA-Lab, CNRS & Grenoble-Alps Univ.

Description

Here are 289 chapters of 5 audiobooks from Librivox (51:12) read by Nadine Eckert-Boulet (NEB):

  1. Madame Bovary (MB) by Gustave Flaubert (FL) - 3 volumes, 35 chapters
    (original wavs; text)
  2. Les mystères de Paris (LMP) by Eugene Sue (ES) - 4 volumes, 83 chapters (original wavs1, wavs2, wavs3; text1, text2, text3)
  3. Les tribulations d'un chinois en Chine (TCC) by Jules Verne (JV) - 1 volume, 22 chapters (original wavs; text)
  4. La fille du pirate (LFDP) by Henri Émile Chevalier (EC) - 7 volumes, 121 chapters (original wavs, text)
  5. La vampire (VAMP) by Paul Féval (PF) - 1 volume, 28 chapters (original wavs, text)

and

2515 utterances (2:03) read by another female French speaker Aurélie Derbier (AD):

  1. 1608 utterances extracted from various books (DIVERS_BOOK_AD*)
  2. 907 transcripts of the sessions of the French parliament (DIVERS_PARL_01*)

We recently added three speakers from Librivox/Litteratureaudio:

  1. Ezwa (EZWA): L'épouvante by Maurice Level (original wavs; text) - 11 chapters - 4869 utterances> 03:16
  2. Pauline Latournerie (PL): Le pédagogue n'aime pas les enfants by Henri Roorda (original wavs; text) - 6 chapters - 1320 utterances> 01:17
  3. Jean-Luc Fischer (JLF): L’Affaire Charles Dexter Ward by Howard Phillips Lovecraft (original wavs; text) - 16 chapters - 1823 utterances> 02:37

Each .wav file (sampled at 22050Hz) corresponds to one entire chapter. The format of the filenames is:
{author's acronym}_{book's acronym}_{reader's acronym}_{volume's number}_{chapter's number}

The NEB_train.csv file gives text and phonetic alignments (essentially for MB and LMP) for utterances in 4 fields separated by '|':
{filename}|{start_ms}|{end_ms}|{text or phonetic content}. Most utterances are separated by at least a pause of 400ms. The intervals [start_ms:end_ms] comprise leading and trailing silences of 130ms (since wavs are entire chapters, these silences are "true" ambient silences). Same for AD_train.csv.

When phonetic alignment has been performed, 2 additional fields have been added: {aligned phones}|{durations in ms}. Each input character or phone has a corresponding aligned phone and a duration. Note that all aligned utterances start and end with an aligned phone of 130ms. The set of aligned phones comprises:

  • The set of input phones
  • The silence: '__'
  • The symbol '_' for silent characters, e.g. "chat" is aligned with 's^ _ a _'
  • 29 combined aligned phones ('a&i', 'a&j', 'b&q', 'd&q','d&z', 'd&z^', 'f&q', 'g&q', 'g&z', 'j&i', 'j&u', 'j&q', 'i&j', 'k&q', 'k&s', 'k&s&q', 'l&q', 'm&q', 'n&q', 'r&w', 'r&q', 's&q', 't&q', 't&s', 't&s^', 'w&a', 'z&q', 'p&q') that align to only one character, e.g. "expatrier" is aligned with 'e^ k&s p a t r i&j e _'

Text is in UTF8. '«»','¬', '~','""','()','[]' are respectively used for speaking quotes, turn switches, three dots, quoted expression, aside quotes, notes. Because of rare occurrences, 'ö' has been transcribed as 'oe'. Paragraphs (two consecutive carriage returns in the original text) are cued by a special character '§'. It usually ends an utterance but could be used within an utterance if its associated pause is too short.

When available, phonetic content is given per word in curly brackets '{}'. We use 39 phonetic symbols:

  • oral vowels: a (fa), e (fée), e^ (fait), x (feu), x^ (coeur), i (riz), y (fut), u (fou), o (faux), o^ (porc)
  • schwa: q (gage)
  • nasal vowels: a~ (rang), e~ (fin), x~ (un), o~ (rond)
  • semi-vowels: h (huit), w (ouate), j (hier)
  • consonants: p (pas), t (tas), k (cas), b (bas), d (dos), g (gars), f (faux), s (sot) , s^ (chat), v (vu), z (zut), z^ (jus), r (riz), l (la), m (ma), n (non), n~ (oignon), ng (camping)

 

Files

CDW_LC_JLF_01_0001.wav

Files (1.7 GB)

Name Size Download all
md5:dfe63a81c7fdca6c19ca4d42d4c778b1
28.6 MB Preview Download
md5:50f9a28426995452cf2aa7f7c15e4107
32.8 MB Preview Download
md5:d4c0259c36d4e6b7f4bafbd8c7259180
45.9 MB Preview Download
md5:c815f1e3c9eae8cf156115967a9f9489
29.3 MB Preview Download
md5:27e8763768bd418adfa487a1586c44c9
36.7 MB Preview Download
md5:9bb1f67d556573518561089bb30033e9
34.3 MB Preview Download
md5:b284bdff92caea2752cb4a4616db0395
35.1 MB Preview Download
md5:f50ad0b783545acb99b0fa796d3c3560
14.6 MB Preview Download
md5:4ffcc3d1704964b76e1f47104c33babe
33.9 MB Preview Download
md5:5ca435fc6e56315a2f091fa03dea5702
24.8 MB Preview Download
md5:abe23c2808719eaecd93741a30e6d903
23.3 MB Preview Download
md5:261fec839320c7b0e64b1db7f9352332
38.2 MB Preview Download
md5:0c1f2d4637f9733922faedbbebedc1a4
25.1 MB Preview Download
md5:0bdd1a3fd731415aef1511a4af3e3bbf
30.3 MB Preview Download
md5:b61c9d9e5fb8960a3b924118f832124e
29.4 MB Preview Download
md5:596f5c49474e6049eb31c309fad9eae4
32.7 MB Preview Download
md5:2c3792ec8687c13d352cbc7b06e79150
42.4 MB Preview Download
md5:20a152bf16f84217c29c31d6e4ed8342
50.5 MB Preview Download
md5:adc26c5e35f5de1a509301696def112d
427.2 kB Preview Download
md5:61afa9122c225124244a5c3819036e7d
237.2 kB Preview Download
md5:58f4264693dc5c29530d32aa1459da5a
23.5 MB Preview Download
md5:8efb506efae08c8f07e538f0a9c0f8e4
33.9 MB Preview Download
md5:123098fb41c7f9706bd1c6ab10df651f
26.0 MB Preview Download
md5:7104f53f5b5002f052ca3076093b7aca
32.5 MB Preview Download
md5:96535ef58b109615263abb35d7dc0b56
14.8 MB Preview Download
md5:9e1a1303805aa6c4d2903e019d5799da
104.3 MB Preview Download
md5:5038b60047e1d6dba8d7a7d05bc78a12
46.7 MB Preview Download
md5:1d3d28a320eeec69f6e02fc3fada32b2
18.3 MB Preview Download
md5:b64cc45c726cf26cf60e0483e7afa2bd
35.5 MB Preview Download
md5:fe0d37235add2e4299a15a2c581b2d41
16.1 MB Preview Download
md5:f9bc4d3ead253fd06d4fc6712af3e1ee
2.4 MB Preview Download
md5:843d4a4d974b7b43b518ec2349d1c50d
133.3 MB Preview Download
md5:b59c8aa9c3529911673f7951b3185da5
57.3 MB Preview Download
md5:3ff7ca0e28502f841a49f9a2617e9a6d
71.3 MB Preview Download
md5:5e879912d0208cf6bbd9617a5866499c
49.0 MB Preview Download
md5:63de9fae6c6fa9ef36cfdeb488cbfec4
54.6 MB Preview Download
md5:dd95b8dc03fc652e4b536c5b0b13f3de
71.8 MB Preview Download
md5:201ef972e0777bd23b333258b2d9c654
64.5 MB Preview Download
md5:229f25e05335f501aa7701ff836dc36f
67.3 MB Preview Download
md5:175ac13149489093ab98b47bd7ba8dd5
60.5 MB Preview Download
md5:b0ba53583e8ef187acf16095f627b04c
107.7 MB Preview Download
md5:a72aa4c1041dbca599d70d306a1fc4b9
136.9 kB Preview Download

Additional details

Funding

Agence Nationale de la Recherche
MIAI – MIAI @ Grenoble Alpes ANR-19-P3IA-0003