High-Performance OCR for Printed English and Fraktur Using LSTM Networks
- 1 August 2013
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 683-687
- https://doi.org/10.1109/icdar.2013.140
Abstract
Long Short-Term Memory (LSTM) networks have yielded excellent results on handwriting recognition. This paper describes an application of bidirectional LSTM networks to the problem of machine-printed Latin and Fraktur recognition. Latin and Fraktur recognition differs significantly from handwriting recognition in both the statistical properties of the data, as well as in the required, much higher levels of accuracy. Applications of LSTM networks to handwriting recognition use two-dimensional recurrent networks, since the exact position and baseline of handwritten characters is variable. In contrast, for printed OCR, we used a one-dimensional recurrent network combined with a novel algorithm for baseline and x-height normalization. A number of databases were used for training and testing, including the UW3 database, artificially generated and degraded Fraktur text and scanned pages from a book digitization project. The LSTM architecture achieved 0.6% character-level test-set error on English text. When the artificially degraded Fraktur data set is divided into training and test sets, the system achieves an error rate of 1.64%. On specific books printed in Fraktur (not part of the training set), the system achieves error rates of 0.15% (Fontane) and 1.47% (Ersch-Gruber). These recognition accuracies were found without using any language modelling or any other post-processing techniques.Keywords
This publication has 10 references indexed in Scilit:
- History of the Tesseract OCR engine: what worked and what didn'tPublished by SPIE-Intl Soc Optical Eng ,2013
- Improving Offline Handwritten Text Recognition with Hybrid HMM/ANN ModelsIEEE Transactions on Pattern Analysis and Machine Intelligence, 2010
- A Novel Connectionist System for Unconstrained Handwriting RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence, 2008
- Real-Time Computing Without Stable States: A New Framework for Neural Computation Based on PerturbationsNeural Computation, 2002
- Long Short-Term MemoryNeural Computation, 1997
- LeRec: A NN/HMM Hybrid for On-Line Handwriting RecognitionNeural Computation, 1995
- Document image decoding using Markov source modelsIEEE Transactions on Pattern Analysis and Machine Intelligence, 1994
- Learning long-term dependencies with gradient descent is difficultIEEE Transactions on Neural Networks, 1994
- Finding structure in timeCognitive Science, 1990
- A tutorial on hidden Markov models and selected applications in speech recognitionProceedings of the IEEE, 1989