Development of the CU-HTK 2004 Broadcast News Transcription Systems
- 11 October 2006
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE) in Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.
- Vol. 1, I/861
- https://doi.org/10.1109/icassp.2005.1415250
Abstract
The paper describes our recent work on improving broadcast news transcription and presents details of the CU-HTK broadcast news English (BN-E) transcription system for the DARPA/NIST rich transcription 2004 speech-to-text (RT04) evaluation. A key focus has been building a system using an order of magnitude more acoustic training data than we have previously attempted. We have also investigated a range of techniques to improve both minimum phone error (MPE) training and the efficient creation of MPE-based narrow-band models. The paper describes two alternative system structures that run in under 10/spl times/RT and a further system that runs in less than 1/spl times/RT. This final system gives lower word error rates than our 2003 system that ran in 10/spl times/RT.Keywords
This publication has 4 references indexed in Scilit:
- Improving broadcast news transcription by lightly supervised discriminative trainingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- The LIMSI Broadcast News transcription systemSpeech Communication, 2002
- Lightly supervised and unsupervised acoustic model trainingComputer Speech & Language, 2002
- Minimum phone error and I-smoothing for improved discriminative trainingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002