Dataset Open Access

Composite Embedding Systems Based on DNN-HMM and Attention End-To-End for ZeroSpeech2017 track1 (1)

Shibata Hayato; Kato Taku; Shinozaki Takahiro; Watanabe Shinji

Deep neural networks (DNNs) were trained for posterior and bottleneck features using Japanese and other language speech data. We explore various DNN types, their combinations, and dimension reduction by principal component analysis (PCA).

This version (version 1) extracts DNN bottleneck features obtained from GMM based SAT features. The DNN and GMM were trained by speech data from the corpus of spontaneous Japanese (CSJ).

Files (8.2 GB)
Name Size
10_5281_zenodo_815089.tar.gz
md5:790449b4ecd35f7bea1ee37e501eeaac
8.2 GB Download

Share

Cite as