Nonspeech segment rejection based on prosodic information for robust speech recognition

10 December 2002

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Signal Processing Letters

Vol. 9 (11), 364-367
https://doi.org/10.1109/lsp.2002.804564

Abstract

A new scheme for nonspeech rejection is proposed by considering that most nonspeech segments do not have well-defined prosodic structures as speech segments do. Certain parameters characterizing the smoothness of the peak index series and of the peak amplitude series of the normalized autocorrelation function are used to make nonspeech segment rejection decisions. The receiver-operating-characteristics curve and recognition word-error-rate reduction measures show that our approach is more effective than garbage-model-based schemes when used in telephone speech recognition.

Keywords

This publication has 2 references indexed in Scilit:

Rejection techniques for digit recognition in telecommunication applications
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1993
An improved endpoint detector for isolated word recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1981

Cited by 11 articles