Warped Magnitude and Phase-Based Features for Language Identification
- 2 August 2006
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 1, 201-204
- https://doi.org/10.1109/icassp.2006.1659992
Abstract
To date, systems for the identification of spoken languages have normally used magnitude-based parameterization methods such as the MFCC and PLP. This paper investigates the use of the recently proposed modified group delay function (MODGDF) coefficients in combination with traditional magnitude-based features in a Gaussian mixture model (GMM) based system. We also examine the application of feature warping to magnitude-based features and the MODGDF and find that it can offer a significant cumulative improvement. We find that the addition of a modified regression-based shifted delta cepstrum (SDC) further improves system performance beyond that obtained by a more standard SDC configuration. The combination of PLP, feature warping and the proposed regression-based SDC achieved an accuracy of 88.4% in tests on 10 languages in the OGI TS Corpus, which compares very favourably with alternative language identification systems reported in the literatureKeywords
This publication has 4 references indexed in Scilit:
- Language Identification using Warping and the Shifted Delta CepstrumPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Encoding Frequency Modulation to Improve Cochlear Implant Performance in NoiseIEEE Transactions on Biomedical Engineering, 2004
- The modified group delay function and its application to phoneme recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Speaker-independent isolated word recognition using dynamic features of speech spectrumIEEE Transactions on Acoustics, Speech, and Signal Processing, 1986