An investigation of deep neural networks for noise robust speech recognition

Top Cited Papers

1 May 2013

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

No. 15206149,p. 7398-7402
https://doi.org/10.1109/icassp.2013.6639100

Abstract

Recently, a new acoustic model based on deep neural networks (DNN) has been introduced. While the DNN has generated significant improvements over GMM-based systems on several tasks, there has been no evaluation of the robustness of such systems to environmental distortion. In this paper, we investigate the noise robustness of DNN-based acoustic models and find that they can match state-of-the-art performance on the Aurora 4 task without any explicit noise compensation. This performance can be further improved by incorporating information about the environment into DNN training using a new method called noise-aware training. When combined with the recently proposed dropout training technique, a 7.5% relative improvement over the previously best published result on this task is achieved using only a single decoding pass and no additional decoding complexity compared to a standard DNN.

Keywords

This publication has 14 references indexed in Scilit:

Speaker and Noise Factorization for Robust Speech Recognition
IEEE Transactions on Audio, Speech, and Language Processing, 2012
Derivative kernels for noise robust ASR
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
Comparing multilayer perceptron to Deep Belief Network Tandem features for robust ASR
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
Acoustic Modeling Using Deep Belief Networks
IEEE Transactions on Audio, Speech, and Language Processing, 2011
Noise Adaptive Training for Robust Automatic Speech Recognition
IEEE Transactions on Audio, Speech, and Language Processing, 2010
Acoustic model adaptation via Linear Spline Interpolation for robust speech recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2010
Discriminative adaptive training with VTS and JUD
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
Noise reduction using connectionist models
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Connectionist probability estimators in HMM speech recognition
IEEE Transactions on Speech and Audio Processing, 1994
Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1985

Cited by 350 articles