Dataset Open Access

Multilingual bottle-neck feature learning from untranscribed speech for track 1 in zerospeech2017 (system 2 -- with VTLN)

Hongjie Chen Chen; Cheung-Chi Leung; Lei Xie; Bin Ma; Haizhou Li

We investigate the extraction of bottle-neck features (BNFs) for multiple languages without access to manual transcription. Multilingual BNFs are derived from a multi-task learning deep neural network which is trained with unsupervised phoneme-like labels. The unsupervised phoneme-like labels are obtained from language-dependent Dirichlet process Gaussian mixture models separately trained on untranscribed speech of multiple languages.

In this version, the input MFCC for DPGMM is processed with VTLN.

 

Files (8.0 GB)
Name Size
10_5281_zenodo_822737.tar.gz
md5:aec84d688de278e3ed8df99fc536db68
8.0 GB Download
31
6
views
downloads
All versions This version
Views 3131
Downloads 66
Data volume 48.2 GB48.2 GB
Unique views 3131
Unique downloads 66

Share

Cite as