An Objective Evaluation Methodology for Document Image Binarization Techniques
- 1 September 2008
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 217-224
- https://doi.org/10.1109/das.2008.41
Abstract
Evaluation of document image binarization techniques is a tedious task that is mainly performedby a human expert or by involving an OCR engine. This paper presents an objective evaluation methodology for document image binarization techniques that aims to reduce the human involvement in the ground truth construction and consecutive testing. A skeletonized ground truth image is produced by the user following a semi-automatic procedure. The estimated ground truth image can aid in evaluating the binarization result in terms of recall and precision as well as to further analyze the result by calculating broken and missing text, deformations and false alarms. A detailed description of the methodology along with a benchmarking of the six (6) most promising state-of-the-art binarization algorithms based on the proposed methodology is presented.Keywords
This publication has 17 references indexed in Scilit:
- Handwritten Carbon Form Preprocessing Based on Markov Random FieldPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- Adaptive degraded document image binarizationPattern Recognition, 2006
- A comparison of binarization methods for historical archive documentsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Survey over image thresholding techniques and quantitative performance evaluationJournal of Electronic Imaging, 2004
- Text localization, enhancement and binarization in multimedia documentsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- An adaptive logical method for binarization of degraded document imagesPattern Recognition, 2000
- Adaptive document image binarizationPattern Recognition, 2000
- Binarization of document images using Hadamard multiresolution analysisPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1999
- Recognition of handwritten Chinese characters via short line segmentsPattern Recognition, 1992
- A Threshold Selection Method from Gray-Level HistogramsIEEE Transactions on Systems, Man, and Cybernetics, 1979