An Energy-Efficient Speech-Extraction Processor for Robust User Speech Recognition in Mobile Head-Mounted Display Systems
- 24 May 2016
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Circuits and Systems II: Express Briefs
- Vol. 64 (4), 457-461
- https://doi.org/10.1109/tcsii.2016.2571902
Abstract
An energy-efficient speech extraction (SE) processor is proposed for robust user speech recognition (SR) in head-mounted display (HMD) systems. User SE is essential for robust user SR in a noisy environment. For the low-latency SE, the FastSE algorithm is proposed to overcome the time-consuming constrained-independent-component-analysis-based user speech selection process, which results in <; 2-ms SE latency. Moreover, a reinforced-FastSE scheme is proposed to achieve 97.2% accuracy with only 33-kB FastSE on-chip memory for the low-power HMD applications. Also, a reconfigurable matrix operation accelerator is implemented for the energy-efficient acceleration of the dominant matrix operation in SE. As a result, the proposed SE processor achieves 1.3× higher speed with 4.24× smaller memory compared to the state-of-the-art work, so SR in a noisy environment becomes possible for mobile HMD applications.Keywords
Funding Information
- Basic Science Research Program
- National Research Foundation of Korea
- Ministry of Science, ICT & Future Planning (NRF-2015R1A2A1A05001889)
This publication has 8 references indexed in Scilit:
- An 81.6 $\mu {\rm W}$ FastICA Processor for Epileptic Seizure DetectionIEEE Transactions on Biomedical Circuits and Systems, 2014
- Energy-Efficient FastICA Implementation for Biomedical Signal SeparationIEEE Transactions on Neural Networks, 2011
- Extracting a source of shorter source-to-microphone distance from convolutive mixturesElectronics Letters, 2011
- Implementation of Pipelined FastICA on FPGA for Real-Time Blind Source SeparationIEEE Transactions on Neural Networks, 2008
- Minimal distortion principle for blind source separationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Independent component analysis: algorithms and applicationsNeural Networks, 2000
- The generalized correlation method for estimation of time delayIEEE Transactions on Acoustics, Speech, and Signal Processing, 1976