A Hybrid Approach to Detect and Localize Texts in Natural Scene Images
- 2 September 2010
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Image Processing
- Vol. 20 (3), 800-813
- https://doi.org/10.1109/tip.2010.2070803
Abstract
Text detection and localization in natural scene images is important for content-based image analysis. This problem is challenging due to the complex background, the non-uniform illumination, the variations of text font, size and line orientation. In this paper, we present a hybrid approach to robustly detect and localize texts in natural scene images. A text region detector is designed to estimate the text existing confidence and scale information in image pyramid, which help segment candidate text components by local binarization. To efficiently filter out the non-text components, a conditional random field (CRF) model considering unary component properties and binary contextual component relationships with supervised parameter learning is proposed. Finally, text components are grouped into text lines/words with a learning-based energy minimization method. Since all the three stages are learning-based, there are very few parameters requiring manual tuning. Experimental results evaluated on the ICDAR 2005 competition dataset show that our approach yields higher precision and recall performance compared with state-of-the-art methods. We also evaluated our approach on a multilingual image dataset with promising results.Keywords
This publication has 37 references indexed in Scilit:
- Regularized margin-based conditional log-likelihood loss for prototype learningPattern Recognition, 2010
- Handwritten Chinese text line segmentation by clustering with distance metric learningPattern Recognition, 2009
- A robust approach to text line grouping in online handwritten Japanese documentsPattern Recognition, 2009
- Gaussian mixture modeling and learning of neighboring characters for multilingual text extraction in imagesPattern Recognition, 2008
- Color text extraction with selective metric-based clusteringComputer Vision and Image Understanding, 2007
- A Contour-Based Robust Algorithm for Text Detection in Color ImagesIEICE Transactions on Information and Systems, 2006
- Automatic Detection and Recognition of Signs From Natural ScenesIEEE Transactions on Image Processing, 2004
- Text information extraction in images and video: a surveyPattern Recognition, 2004
- Fast approximate energy minimization via graph cutsIeee Transactions On Pattern Analysis and Machine Intelligence, 2001
- Discriminative learning for minimum error classification (pattern recognition)IEEE Transactions on Signal Processing, 1992