On Combining Multiple Segmentations in Scene Text Recognition

1 August 2013

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 523-527
https://doi.org/10.1109/icdar.2013.110

Abstract

An end-to-end real-time scene text localization and recognition method is presented. The three main novel features are: (i) keeping multiple segmentations of each character until the very last stage of the processing when the context of each character in a text line is known, (ii) an efficient algorithm for selection of character segmentations minimizing a global criterion, and (iii) showing that, despite using theoretically scale-invariant methods, operating on a coarse Gaussian scale space pyramid yields improved results as many typographical artifacts are eliminated. The method runs in real time and achieves state-of-the-art text localization results on the ICDAR 2011 Robust Reading dataset. Results are also reported for end-to-end text recognition on the ICDAR 2011 dataset.

Keywords

This publication has 13 references indexed in Scilit:

Scene text detection using graph model built upon maximally stable extremal regions
Pattern Recognition Letters, 2013
ICDAR 2011 Robust Reading Competition Challenge 2: Reading Text in Scene Images
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
Text Localization in Real-World Images Using Efficiently Pruned Exhaustive Search
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
Text String Detection From Natural Scenes by Structure-Based Partition and Grouping
IEEE Transactions on Image Processing, 2011
Detecting text in natural scenes with stroke width transform
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2010
Text Detection and Localization in Complex Scene Images using Constrained AdaBoost Algorithm
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
Text Localization in Natural Scene Images Based on Conditional Random Field
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
Object count/area graphs for the evaluation of object detection and segmentation algorithms
International Journal on Document Analysis and Recognition (IJDAR), 2006
ICDAR 2003 robust reading competitions
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Localizing and segmenting text in images and videos
IEEE Transactions on Circuits and Systems for Video Technology, 2002

Cited by 72 articles