Mask-based enhancement for very low quality speech

1 May 2014

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 7029-7033
https://doi.org/10.1109/icassp.2014.6854963

Abstract

We propose a mask-based enhancer for very low quality speech that is able to preserve important cues in a noise-robust manner by identifying the time-frequency regions that contain significant speech energy. We use a classifier to estimate a time-frequency mask from an input feature set that provides information about the energy distribution of both voiced and unvoiced speech. We evaluate the enhancer on a range of noisy speech signals and demonstrate that it yields consistent improvements in an objective intelligibility measure.

Keywords

This publication has 21 references indexed in Scilit:

Role of mask pattern in intelligibility of ideal binary-masked noisy speech
The Journal of the Acoustical Society of America, 2009
An algorithm that improves speech intelligibility in noise for normal-hearing listeners
The Journal of the Acoustical Society of America, 2009
Speech perception of noise with binary gains
The Journal of the Acoustical Society of America, 2008
Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction
The Journal of the Acoustical Society of America, 2008
Subjective comparison and evaluation of speech enhancement algorithms
Speech Communication, 2007
An international comparison of long-term average speech spectra
The Journal of the Acoustical Society of America, 1994
Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1985
Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1984
Suggested formulae for calculating auditory-filter bandwidths and excitation patterns
The Journal of the Acoustical Society of America, 1983
Suppression of acoustic noise in speech using spectral subtraction
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1979

Cited by 13 articles