Automatic Detection of Voice Impairments by Means of Short-Term Cepstral Parameters and Neural Network Based Detectors

30 January 2004

journal article
research article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Biomedical Engineering

Vol. 51 (2), 380-384
https://doi.org/10.1109/tbme.2003.820386

Abstract

It is well known that vocal and voice diseases do not necessarily cause perceptible changes in the acoustic voice signal. Acoustic analysis is a useful tool to diagnose voice diseases being a complementary technique to other methods based on direct observation of the vocal folds by laryngoscopy. Through the present paper two neural-network based classification approaches applied to the automatic detection of voice disorders will be studied. Structures studied are multilayer perceptron and learning vector quantization fed using short-term vectors calculated accordingly to the well-known Mel Frequency Coefficient cepstral parameterization. The paper shows that these architectures allow the detection of voice disorders-including glottic cancer-under highly reliable conditions. Within this context, the Learning Vector quantization methodology demonstrated to be more reliable than the multilayer perceptron architecture yielding 96% frame accuracy under similar working conditions.

Keywords

This publication has 20 references indexed in Scilit:

Adaptive noise energy estimation in pathological speech signals
IEEE Transactions on Biomedical Engineering, 2000
A Neural Network Based Approach to Objective Voice Quality Assessment
Published by Springer Science and Business Media LLC ,1999
A Cepstrum-Based Technique for Determining a Harmonics-to-Noise Ratio in Speech Signals
Journal of Speech, Language, and Hearing Research, 1993
Vocal Tremor Analysis With the Vocal Demodulator
Journal of Speech, Language, and Hearing Research, 1992
Detection of laryngeal function using speech and electroglottographic data
IEEE Transactions on Biomedical Engineering, 1992
The self-organizing map
Proceedings of the IEEE, 1990
Short-Term Stability Measures for the Evaluation of Vocal Quality
Journal of Speech, Language, and Hearing Research, 1990
Normalized noise energy as an acoustic measure to evaluate pathologic voice
The Journal of the Acoustical Society of America, 1986
Harmonics-to-Noise Ratio and Psychophysical Measurement of the Degree of Hoarseness
Journal of Speech, Language, and Hearing Research, 1984
Voiced/Unvoiced/Mixed excitation classification of speech
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1982

Cited by 181 articles