Offline recognition of unconstrained handwritten texts using HMMs and statistical language models

19 April 2004

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in Ieee Transactions On Pattern Analysis and Machine Intelligence

Vol. 26 (6), 709-720
https://doi.org/10.1109/tpami.2004.14

Abstract

This paper presents a system for the offline recognition of large vocabulary unconstrained handwritten texts. The only assumption made about the data is that it is written in English. This allows the application of statistical language models in order to improve the performance of our system. Several experiments have been performed using both single and multiple writer data. Lexica of variable size (from 10,000 to 50,000 words) have been used. The use of language models is shown to improve the accuracy of the system (when the lexicon contains 50,000 words, the error rate is reduced by /spl sim/50 percent for single writer data and by /spl sim/25 percent for multiple writer data). Our approach is described in detail and compared with other methods presented in the literature to deal with the same problem. An experimental setup to correctly deal with unconstrained text recognition is proposed.

Keywords

This publication has 27 references indexed in Scilit:

Conjoined location and recognition of street names within a postal address delivery line
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
A statistical approach for phrase location and recognition within a text line: an application to street name recognition
Ieee Transactions On Pattern Analysis and Machine Intelligence, 2002
Machine learning in automated text categorization
ACM Computing Surveys, 2002
Use of adaptive segmentation in handwritten phrase recognition
Pattern Recognition, 2002
An architecture for handwritten text recognition systems
International Journal on Document Analysis and Recognition (IJDAR), 1999
Recognition of legal amounts on bank cheques
Pattern Analysis and Applications, 1998
Handwritten phrase recognition as applied to street name images
Pattern Recognition, 1998
Use of lexical and syntactic techniques in recognizing handwritten text
Published by Association for Computational Linguistics (ACL) ,1994
Control structure for interpreting handwritten addresses
Ieee Transactions On Pattern Analysis and Machine Intelligence, 1994
Error bounds for convolutional codes and an asymptotically optimum decoding algorithm
IEEE Transactions on Information Theory, 1967

Cited by 193 articles