A two-stage binarization approach for document images
- 1 January 2001
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE) in Proceedings of 2001 International Symposium on Intelligent Multimedia, Video and Speech Processing. ISIMP 2001 (IEEE Cat. No.01EX489)
Abstract
Binarization of a gray scale document image is one of the most important steps for automatic document processing. The paper presents a two-stage document image binarization approach. The approach applies a region based binarization technique first to the whole image and utilizes a neural network based binarization technique to those text blocks in which a good character segmentation cannot be achieved at the first stage. Experimental results on a number of document images show that our two-stage binarization approach performs better than other binarization techniques in terms of character segmentation quality and computing timeDepartment of Electronic and Information EngineeringRefereed conference papeKeywords
This publication has 7 references indexed in Scilit:
- A survey of methods and strategies in character segmentationIEEE Transactions on Pattern Analysis and Machine Intelligence, 1996
- Page segmentation using texture analysisPattern Recognition, 1996
- Image segmentation using fuzzy rules derived from K-means clustersJournal of Electronic Imaging, 1995
- Evaluation of binarization methods for document imagesIEEE Transactions on Pattern Analysis and Machine Intelligence, 1995
- Character and line extraction from color map images using a multi-layer neural networkPattern Recognition Letters, 1994
- A robust algorithm for text string separation from mixed text/graphics imagesIEEE Transactions on Pattern Analysis and Machine Intelligence, 1988
- A Threshold Selection Method from Gray-Level HistogramsIEEE Transactions on Systems, Man, and Cybernetics, 1979