Localizing Text in Scene Images by Boundary Clustering, Stroke Segmentation, and String Fragment Classification

15 May 2012

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Image Processing

Vol. 21 (9), 4256-4268
https://doi.org/10.1109/tip.2012.2199327

Abstract

In this paper, we propose a novel framework to extract text regions from scene images with complex backgrounds and multiple text appearances. This framework consists of three main steps: boundary clustering (BC), stroke segmentation, and string fragment classification. In BC, we propose a new bigram-color-uniformity-based method to model both text and attachment surface, and cluster edge pixels based on color pairs and spatial positions into boundary layers. Then, stroke segmentation is performed at each boundary layer by color assignment to extract character candidates. We propose two algorithms to combine the structural analysis of text stroke with color assignment and filter out background interferences. Further, we design a robust string fragment classification based on Gabor-based text features. The features are obtained from feature maps of gradient, stroke distribution, and stroke width. The proposed framework of text localization is evaluated on scene images, born-digital images, broadcast video images, and images of handheld objects captured by blind persons. Experimental results on respective datasets demonstrate that the framework outperforms state-of-the-art localization algorithms.

Keywords

This publication has 26 references indexed in Scilit:

Text String Detection From Natural Scenes by Structure-Based Partition and Grouping
IEEE Transactions on Image Processing, 2011
Features extraction for text detection and localization
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2010
Scene Text Recognition Using Similarity and a Lexicon with Sparse Belief Propagation
Ieee Transactions On Pattern Analysis and Machine Intelligence, 2009
A Laplacian Method for Video Text Detection
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
A New Approach for Overlay Text Detection and Extraction From Complex Video Scene
IEEE Transactions on Image Processing, 2008
ICDAR 2003 robust reading competitions
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
A parallel-line detection algorithm based on HMM decoding
Ieee Transactions On Pattern Analysis and Machine Intelligence, 2005
A comprehensive method for multilingual video text detection, localization, and extraction
IEEE Transactions on Circuits and Systems for Video Technology, 2005
ICDAR 2005 text locating competition results
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Text detection in images based on unsupervised classification of edge-based features
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005

Cited by 106 articles