Epoch Extraction From Speech Signals
Top Cited Papers
- 21 October 2008
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Audio, Speech, and Language Processing
- Vol. 16 (8), 1602-1613
- https://doi.org/10.1109/tasl.2008.2004526
Abstract
Epoch is the instant of significant excitation of the vocal-tract system during production of speech. For most voiced speech, the most significant excitation takes place around the instant of glottal closure. Extraction of epochs from speech is a challenging task due to time-varying characteristics of the source and the system. Most epoch extraction methods attempt to remove the characteristics of the vocal-tract system, in order to emphasize the excitation characteristics in the residual. The performance of such methods depends critically on our ability to model the system. In this paper, we propose a method for epoch extraction which does not depend critically on characteristics of the time-varying vocal-tract system. The method exploits the nature of impulse-like excitation. The proposed zero resonance frequency filter output brings out the epoch locations with high accuracy and reliability. The performance of the method is demonstrated using CMU-Arctic database using the epoch information from the electroglottograph as reference. The proposed method performs significantly better than the other methods currently available for epoch extraction. The interesting part of the results is that the epoch extraction by the proposed method seems to be robust against degradations like white noise, babble, high-frequency channel, and vehicle noise.Keywords
This publication has 22 references indexed in Scilit:
- Determination of Instants of Significant Excitation in Speech Using Hilbert Envelope and Group Delay FunctionIEEE Signal Processing Letters, 2007
- A robust method for determining instants of major excitations in voiced speechPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Determination of instants of significant excitation in speech using group delay functionIEEE Transactions on Speech and Audio Processing, 1995
- A Frobenius norm approach to glottal closure detection from the speech signalIEEE Transactions on Speech and Audio Processing, 1994
- Automatic and reliable estimation of glottal closure instant and periodIEEE Transactions on Acoustics, Speech, and Signal Processing, 1989
- Epoch extraction from linear prediction residual for identification of closed glottis intervalIEEE Transactions on Acoustics, Speech, and Signal Processing, 1979
- Least squares glottal inverse filtering from the acoustic speech waveformIEEE Transactions on Acoustics, Speech, and Signal Processing, 1979
- Epoch extraction of voiced speechIEEE Transactions on Acoustics, Speech, and Signal Processing, 1975
- Determination of the instant of glottal closure from the speech waveThe Journal of the Acoustical Society of America, 1974
- Speech Analysis and Synthesis by Linear Prediction of the Speech WaveThe Journal of the Acoustical Society of America, 1971