Published June 22, 2017 | Version v1
Dataset Open

Composite Embedding Systems Based on DNN-HMM and Attention End-To-End for ZeroSpeech2017 track1 (1)

  • 1. Tokyo Institute of Technology
  • 2. Mitsubishi Electric Research Laboratories

Description

Deep neural networks (DNNs) were trained for posterior and bottleneck features using Japanese and other language speech data. We explore various DNN types, their combinations, and dimension reduction by principal component analysis (PCA).

This version (version 1) extracts DNN bottleneck features obtained from GMM based SAT features. The DNN and GMM were trained by speech data from the corpus of spontaneous Japanese (CSJ).

Files

Files (8.2 GB)

Name Size Download all
md5:790449b4ecd35f7bea1ee37e501eeaac
8.2 GB Download