Dataset Open Access

Multilingual bottle-neck feature learning from untranscribed speech for track 1 in zerospeech2017 (system 2 -- with VTLN)

Hongjie Chen Chen; Cheung-Chi Leung; Lei Xie; Bin Ma; Haizhou Li

We investigate the extraction of bottle-neck features (BNFs) for multiple languages without access to manual transcription. Multilingual BNFs are derived from a multi-task learning deep neural network which is trained with unsupervised phoneme-like labels. The unsupervised phoneme-like labels are obtained from language-dependent Dirichlet process Gaussian mixture models separately trained on untranscribed speech of multiple languages.

In this version, the input MFCC for DPGMM is processed with VTLN.

 

Files (8.0 GB)
Name Size
10_5281_zenodo_822737.tar.gz
md5:aec84d688de278e3ed8df99fc536db68
8.0 GB Download
44
6
views
downloads
All versions This version
Views 4444
Downloads 66
Data volume 48.2 GB48.2 GB
Unique views 4343
Unique downloads 66

Share

Cite as