Localizing Text in Scene Images by Boundary Clustering, Stroke Segmentation, and String Fragment Classification
- 15 May 2012
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Image Processing
- Vol. 21 (9), 4256-4268
- https://doi.org/10.1109/tip.2012.2199327
Abstract
In this paper, we propose a novel framework to extract text regions from scene images with complex backgrounds and multiple text appearances. This framework consists of three main steps: boundary clustering (BC), stroke segmentation, and string fragment classification. In BC, we propose a new bigram-color-uniformity-based method to model both text and attachment surface, and cluster edge pixels based on color pairs and spatial positions into boundary layers. Then, stroke segmentation is performed at each boundary layer by color assignment to extract character candidates. We propose two algorithms to combine the structural analysis of text stroke with color assignment and filter out background interferences. Further, we design a robust string fragment classification based on Gabor-based text features. The features are obtained from feature maps of gradient, stroke distribution, and stroke width. The proposed framework of text localization is evaluated on scene images, born-digital images, broadcast video images, and images of handheld objects captured by blind persons. Experimental results on respective datasets demonstrate that the framework outperforms state-of-the-art localization algorithms.Keywords
This publication has 26 references indexed in Scilit:
- Text String Detection From Natural Scenes by Structure-Based Partition and GroupingIEEE Transactions on Image Processing, 2011
- Features extraction for text detection and localizationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2010
- Scene Text Recognition Using Similarity and a Lexicon with Sparse Belief PropagationIeee Transactions On Pattern Analysis and Machine Intelligence, 2009
- A Laplacian Method for Video Text DetectionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- A New Approach for Overlay Text Detection and Extraction From Complex Video SceneIEEE Transactions on Image Processing, 2008
- ICDAR 2003 robust reading competitionsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- A parallel-line detection algorithm based on HMM decodingIeee Transactions On Pattern Analysis and Machine Intelligence, 2005
- A comprehensive method for multilingual video text detection, localization, and extractionIEEE Transactions on Circuits and Systems for Video Technology, 2005
- ICDAR 2005 text locating competition resultsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Text detection in images based on unsupervised classification of edge-based featuresPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005