A hybrid phonotactic language identification system with an SVM back-end for simultaneous lecture translation
- 1 March 2012
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE) in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- p. 4857-4860
- https://doi.org/10.1109/icassp.2012.6289007
Abstract
In this paper we describe our work in constructing a language identification system for use in our simultaneous lecture translation system. We first built PPR and PPRLM baseline systems that produce score-fusing language cue feature vectors for language discrimination and utilize an SVM back-end classifier for the actual language identification. On our bi-lingual lecture tasks the PPRLM system clearly outperforms the PPR system in various segment length conditions, however at the cost of slower run-time. By using lexical information in the form of keyword spotting, and additional language models we show ways to improve the performance of both baseline systems. In order to combine the faster run-time of the PPR system with the better performance of the PPRLM system we finally built a hybrid of both approaches that clearly outperforms the PPR system while not adding any additional computing time. This hybrid system is therefore our choice for the use in the lecture translation system due to its faster run-time and good performance.Keywords
This publication has 9 references indexed in Scilit:
- LIBSVMACM Transactions on Intelligent Systems and Technology, 2011
- Quaero Speech-to-Text and Text Translation Evaluation SystemsPublished by Springer Science and Business Media LLC ,2011
- Language IdentificationPublished by Wiley ,2009
- Automatic language identification using support vector machines and phonetic N-gramPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2008
- The Design of Backend Classifiers in PPRLM System for Language IdentificationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- Automatic Language IdentificationPublished by Elsevier BV ,2006
- A one-pass decoder based on polymorphic linguistic context assignmentPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Comparison of four approaches to automatic language identification of telephone speechIEEE Transactions on Speech and Audio Processing, 1996
- Support-vector networksMachine Learning, 1995