A full English sentence database for off-line handwriting recognition
- 1 January 1999
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
We present a new database for off-line handwriting recognition, together with a few preprocessing and text segmentation procedures. The database is based on the Lancaster-Oslo/Bergen(LOB) corpus. This corpus is a collection of tests that were used to generate forms, which subsequently were filled out by persons in their own handwriting. As of December 1998 the database includes 556 forms produced by approximately 250 different writers. The database consists of full English sentences. It could serve as a basis for a variety of handwriting recognition tasks. The main focus, however is on recognition techniques that use linguistic knowledge beyond the lexicon level. This knowledge can be automatically derived from the corpus or it can be supplied from external sources.Keywords
This publication has 11 references indexed in Scilit:
- Handwritten Korean character image database PE92Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Refinement of a Structured Language ModelPublished by Springer Science and Business Media LLC ,1999
- DATA SETS FOR OCR AND DOCUMENT IMAGE UNDERSTANDING RESEARCHPublished by World Scientific Pub Co Pte Ltd ,1997
- Incorporating language syntax in visual text recognition with a statistical modelIeee Transactions On Pattern Analysis and Machine Intelligence, 1996
- Context dependent search in interconnected hidden Markov model for unconstrained handwriting recognitionPattern Recognition, 1995
- A database for handwritten text recognition researchIeee Transactions On Pattern Analysis and Machine Intelligence, 1994
- UNIPEN project of on-line data exchange and recognizer benchmarksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1994
- Computer recognition of unconstrained handwritten numeralsProceedings of the IEEE, 1992
- SELF-ORGANIZED LANGUAGE MODELING FOR SPEECH RECOGNITIONPublished by Elsevier BV ,1990
- Experiments in Text Recognition with Binary n-Gram and Viterbi AlgorithmsIeee Transactions On Pattern Analysis and Machine Intelligence, 1982