Acoustic interference cancellation for a voice-driven interface in smart TVs
- 4 April 2013
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Consumer Electronics
- Vol. 59 (1), 244-249
- https://doi.org/10.1109/TCE.2013.6490266
Abstract
A novel method is proposed to improve the voice recognition performance by suppressing acoustic interferences that add nonlinear distortion to a target recording signal when received by the recognition device. The proposed method is expected to provide the best performance in smart TV environments, where a remote control collects command speech by the internal microphone and performs automatic voice recognition, and the secondary microphone equipped in a TV set provides the reference signal for the background noise source. Due to the transmission channel, the original interference is corrupted nonlinearly, and the conventional speech enhancement techniques such as beamforming and blind signal separation are not applicable. The proposed method first equalizes the interference in the two microphones by maximizing the instantaneous correlation between the nonlinearly related target recording and reference signal, and suppresses the equalized interference. To obtain an optimal estimation of the equalization filter, a method for detecting instantaneous activity of interference is also proposed. The validity of the proposed method is proved by the improvement in automatic voice recognition performance in a simulated TV room where loud TV sounds or babbling speech interfere in a user's commanding speech.Keywords
This publication has 15 references indexed in Scilit:
- Multistage utterance verification for keyword recognition-based online spoken content retrievalIEEE Transactions on Consumer Electronics, 2012
- Saliency-directed color image segmentation using modified particle swarm optimizationSignal Processing, 2012
- Efficient spectrum estimation of noise using line spectral pairs for robust speech recognitionElectronics Letters, 2011
- A voice trigger system using keyword and speaker recognition for mobile devicesIEEE Transactions on Consumer Electronics, 2009
- Independent vector analysis using densities represented by chain-like overlapped cliques in graphical models for separation of convolutedly mixed signalsElectronics Letters, 2009
- Spectral enhancement based on global soft decisionIEEE Signal Processing Letters, 2000
- Convolutive blind separation of non-stationary sourcesIEEE Transactions on Speech and Audio Processing, 2000
- On-line EM Algorithm for the Normalized Gaussian NetworkNeural Computation, 2000
- Suppression of acoustic noise in speech using spectral subtractionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1979
- Adaptive noise cancelling: Principles and applicationsProceedings of the IEEE, 1975