On Combining Multiple Segmentations in Scene Text Recognition
- 1 August 2013
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 523-527
- https://doi.org/10.1109/icdar.2013.110
Abstract
An end-to-end real-time scene text localization and recognition method is presented. The three main novel features are: (i) keeping multiple segmentations of each character until the very last stage of the processing when the context of each character in a text line is known, (ii) an efficient algorithm for selection of character segmentations minimizing a global criterion, and (iii) showing that, despite using theoretically scale-invariant methods, operating on a coarse Gaussian scale space pyramid yields improved results as many typographical artifacts are eliminated. The method runs in real time and achieves state-of-the-art text localization results on the ICDAR 2011 Robust Reading dataset. Results are also reported for end-to-end text recognition on the ICDAR 2011 dataset.Keywords
This publication has 13 references indexed in Scilit:
- Scene text detection using graph model built upon maximally stable extremal regionsPattern Recognition Letters, 2013
- ICDAR 2011 Robust Reading Competition Challenge 2: Reading Text in Scene ImagesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2011
- Text Localization in Real-World Images Using Efficiently Pruned Exhaustive SearchPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2011
- Text String Detection From Natural Scenes by Structure-Based Partition and GroupingIEEE Transactions on Image Processing, 2011
- Detecting text in natural scenes with stroke width transformPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2010
- Text Detection and Localization in Complex Scene Images using Constrained AdaBoost AlgorithmPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- Text Localization in Natural Scene Images Based on Conditional Random FieldPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- Object count/area graphs for the evaluation of object detection and segmentation algorithmsInternational Journal on Document Analysis and Recognition (IJDAR), 2006
- ICDAR 2003 robust reading competitionsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Localizing and segmenting text in images and videosIEEE Transactions on Circuits and Systems for Video Technology, 2002