Dataset Open Access

Composite Embedding Systems Based on DNN-HMM and Attention End-To-End for ZeroSpeech2017 track1 (2)

Shibata Hayato; Kato Taku; Shinozaki Takahiro; Watanabe Shinji

Deep neural networks (DNNs) were trained for posterior and bottleneck features using Japanese and other language speech data. We explore various DNN types, their combinations, and dimension reduction by principal component analysis (PCA).

This version (version 2 ) concatenates  CSJ feature vector and PCA compressed feature vector made from attention end-to-end feature.

X:CSJ feature (60 dim bottleneck, (version 1 feature))

S:Attention end-to-end feature (320 dim)

T:PCA(S) (60 dim)


Files (20.2 GB)
Name Size
20.2 GB Download


Cite as