The benefit of combining a deep neural network architecture with ideal ratio mask estimation in computational speech segregation to improve speech intelligibility

Bentsen, Thomas; May, Tobias; Kressner, Abigail Anne; Dau, Torsten

doi:10.5281/zenodo.1202206

Published March 17, 2018 | Version v1

Dataset Open

The benefit of combining a deep neural network architecture with ideal ratio mask estimation in computational speech segregation to improve speech intelligibility

1. Hearing Systems, Technical University of Denmark

Contains all the data:

Bentsen, T., T.May, A. A. Kresnner, and T. Dau. The benefit of combining
a deep neural network architecture with ideal ratio mask estimation
in computational speech segregation to improve speech intelligibility.
PLOS ONE., in review.

There are two folders:

WRSs: the Word Recognition Scores (WRSs) from the listener study. The matrix has dimensions 9 conditions x 20 subjects. Data is ordered corresponding to the following condition order:
'UP', 'GMM', 'GMM (3 subbands)', 'GMM (7 subbands)', 'GMM (11 subbands)', 'DNN (IBM)'; 'DNN (IBM, 40 ms)'; 'DNN (IRM)'; 'DNN (IRM, 40 ms)'
Masks:
- GMM-IBMs: IBMs and estimated IBMs for the models 'GMM', 'GMM (3 subbands)', 'GMM (7 subbands)', 'GMM (11 subbands)'
- DNN-IBMs: IBMs and estimated IBMs for the models 'DNN (IBM)'; 'DNN (IBM, 40 ms)'
- DNN-IRMs: IRMs and estimated IRMs for the models 'DNN (IRM)'; 'DNN (IRM, 40 ms)'

Files

Masks.zip

Files (149.3 MB)

Name	Size	Download all
Masks.zip md5:8a5dc7b4a884a4f7881e1aa7e568d463	149.3 MB	Preview Download
WRSs.zip md5:e73624b1e0e426e90bfffe3eb1f65d51	873 Bytes	Preview Download

	All versions	This version
Views	298	298
Downloads	69	69
Data volume	5.7 GB	5.7 GB

The benefit of combining a deep neural network architecture with ideal ratio mask estimation in computational speech segregation to improve speech intelligibility

Authors/Creators

Description

Files

Masks.zip

Files (149.3 MB)