Published January 2, 2023
| Version 0.0.2
Dataset
Open
LJ Speech - Aligned IPA transcriptions
Description
Files:
-
grids.zip
- contains TextGrids for all audio files containing three tiers
words
,phonemes
andtranscription
words
contains the aligned normalized English wordsphonemes
contains IPA pronunciations transcribed using CMU dictionary which then were aligned with Montreal Forced Aligner. The pronunciations were then mapped from ARPAbet to IPA and duration marks were applied (without punctuation)transcription
contains unaligned phonemes including punctuation and word boundary labels (SIL0)
- contains TextGrids for all audio files containing three tiers
-
preview.png
- preview of the first TextGrid opened in Praat
-
words-vocabulary.txt
- contains all words from tier
words
- contains all words from tier
-
phonemes-vocabulary.txt
- contains all phonemes from tier
phonemes
- contains all phonemes from tier
-
transcription-vocabulary.txt
- contains all phonemes/punctuation from tier
transcription
- contains all phonemes/punctuation from tier
-
phonemes-durations.pdf
- contains the plotted phoneme duration distribution of tier
phonemes
- contains the plotted phoneme duration distribution of tier
-
phonemes-durations-simple.pdf
- contains the plotted phoneme duration distribution of tier
phonemes
if all duration markers are ignored
- contains the plotted phoneme duration distribution of tier
-
pronunciations.dict
- contains the pronunciations for each word including punctuation and weights (occurrence)
-
script.sh
- contains the script to reproduce all results
Phoneme duration marker:
˘
-> [0, 20) percentileˑ
-> [80, 90) percentileː
-> [90, inf) percentile
Silence marker:
SIL0
-> no silenceSIL1
-> [0, 33.33) percentileSIL2
-> [33.33, 66.66) percentileSIL3
-> [66.66, inf) percentile
Notes
Files
preview.png
Files
(37.8 MB)
Name | Size | Download all |
---|---|---|
md5:009e100ae3ad8523dc6b153a21bcba9a
|
33.9 MB | Preview Download |
md5:32cf66e25111cafd8c2abf10f2f15db2
|
164.8 kB | Preview Download |
md5:1394471b64abb690a20da9edd57f95d1
|
571.7 kB | Preview Download |
md5:2d38fa06d5c919e44e394a4896ef2bc7
|
1.8 kB | Preview Download |
md5:43af2d4563c16e5afcd2cb6c54b237ca
|
170.7 kB | Preview Download |
md5:43b3fb61602f69e812055143d39fb41c
|
2.7 MB | Download |
md5:db0435c1498850a89e117a662b75c477
|
31.8 kB | Download |
md5:729ba8f5ce49336c5d3281605a65f62e
|
1.8 kB | Preview Download |
md5:e8b3648b0f8afb2215460436e0108198
|
217.2 kB | Preview Download |
Additional details
References
- McAuliffe, M., Socolof, M., Mihuc, S., Wagner, M., & Sonderegger, M. (2017). Montreal Forced Aligner: Trainable Text-Speech Alignment Using Kaldi. Interspeech 2017, 498–502. https://doi.org/10.21437/Interspeech.2017-1386