High-Performance OCR for Printed English and Fraktur Using LSTM Networks

1 August 2013

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 683-687
https://doi.org/10.1109/icdar.2013.140

Abstract

Long Short-Term Memory (LSTM) networks have yielded excellent results on handwriting recognition. This paper describes an application of bidirectional LSTM networks to the problem of machine-printed Latin and Fraktur recognition. Latin and Fraktur recognition differs significantly from handwriting recognition in both the statistical properties of the data, as well as in the required, much higher levels of accuracy. Applications of LSTM networks to handwriting recognition use two-dimensional recurrent networks, since the exact position and baseline of handwritten characters is variable. In contrast, for printed OCR, we used a one-dimensional recurrent network combined with a novel algorithm for baseline and x-height normalization. A number of databases were used for training and testing, including the UW3 database, artificially generated and degraded Fraktur text and scanned pages from a book digitization project. The LSTM architecture achieved 0.6% character-level test-set error on English text. When the artificially degraded Fraktur data set is divided into training and test sets, the system achieves an error rate of 1.64%. On specific books printed in Fraktur (not part of the training set), the system achieves error rates of 0.15% (Fontane) and 1.47% (Ersch-Gruber). These recognition accuracies were found without using any language modelling or any other post-processing techniques.

Keywords

This publication has 10 references indexed in Scilit:

History of the Tesseract OCR engine: what worked and what didn't
Published by SPIE-Intl Soc Optical Eng ,2013
Improving Offline Handwritten Text Recognition with Hybrid HMM/ANN Models
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2010
A Novel Connectionist System for Unconstrained Handwriting Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008
Real-Time Computing Without Stable States: A New Framework for Neural Computation Based on Perturbations
Neural Computation, 2002
Long Short-Term Memory
Neural Computation, 1997
LeRec: A NN/HMM Hybrid for On-Line Handwriting Recognition
Neural Computation, 1995
Document image decoding using Markov source models
IEEE Transactions on Pattern Analysis and Machine Intelligence, 1994
Learning long-term dependencies with gradient descent is difficult
IEEE Transactions on Neural Networks, 1994
Finding structure in time
Cognitive Science, 1990
A tutorial on hidden Markov models and selected applications in speech recognition
Proceedings of the IEEE, 1989

Cited by 145 articles