Mask-based enhancement for very low quality speech
- 1 May 2014
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 7029-7033
- https://doi.org/10.1109/icassp.2014.6854963
Abstract
We propose a mask-based enhancer for very low quality speech that is able to preserve important cues in a noise-robust manner by identifying the time-frequency regions that contain significant speech energy. We use a classifier to estimate a time-frequency mask from an input feature set that provides information about the energy distribution of both voiced and unvoiced speech. We evaluate the enhancer on a range of noisy speech signals and demonstrate that it yields consistent improvements in an objective intelligibility measure.Keywords
This publication has 21 references indexed in Scilit:
- Role of mask pattern in intelligibility of ideal binary-masked noisy speechThe Journal of the Acoustical Society of America, 2009
- An algorithm that improves speech intelligibility in noise for normal-hearing listenersThe Journal of the Acoustical Society of America, 2009
- Speech perception of noise with binary gainsThe Journal of the Acoustical Society of America, 2008
- Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reductionThe Journal of the Acoustical Society of America, 2008
- Subjective comparison and evaluation of speech enhancement algorithmsSpeech Communication, 2007
- An international comparison of long-term average speech spectraThe Journal of the Acoustical Society of America, 1994
- Speech enhancement using a minimum mean-square error log-spectral amplitude estimatorIEEE Transactions on Acoustics, Speech, and Signal Processing, 1985
- Speech enhancement using a minimum-mean square error short-time spectral amplitude estimatorIEEE Transactions on Acoustics, Speech, and Signal Processing, 1984
- Suggested formulae for calculating auditory-filter bandwidths and excitation patternsThe Journal of the Acoustical Society of America, 1983
- Suppression of acoustic noise in speech using spectral subtractionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1979