Script independent feature set for handwritten text recognition

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE) in 2014 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)

p. 1147-1152
https://doi.org/10.1109/mipro.2014.6859741

Abstract

The efficiency of any character recognition technique is directly dependent on the accuracy of the generated feature set which could uniquely represent a character and hence correctly recognize it. This paper proposes a hybrid approach combining the structural features of the character and a mathematical model of curve fitting to simulate the best features of a character. As a preprocessing step the character is binarized and transformed to a thinned skeleton and the spurious edges are removed. Then, a combination of structural features of the character like number of end points, loops and intersection points are calculated. Further, the thinned character image is statistically zoned into partitions and quadratic curve fitting model is applied on each partition forming a feature vector of coefficients of the curve. This vector is combined with the spatial distribution of the foreground pixels for each zone and hence script independent feature representation. The approach has been evaluated experimentally on English and Hindi scripts. The algorithm achieves as average recognition accuracy of 89% for any script without incorporating any script specific features.

Keywords

This publication has 7 references indexed in Scilit:

A novel feature extraction technique for the recognition of segmented handwritten characters
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2020
Development of Comprehensive Devnagari Numeral and Character Database for Offline Handwritten Character Recognition
Applied Computational Intelligence and Soft Computing, 2012
Neural network based handwritten character recognition system without feature extraction
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
Moment based invariant feature extraction techniques for bilingual character recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2010
Handwritten Numeral Databases of Indian Scripts and Multistage Recognition of Mixed Numerals
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008
Fuzzy Model Based Recognition of Handwritten Hindi Characters
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2007
Feature extraction with wavelet transform for recognition of isolated handwritten Farsi/Arabic characters and numerals
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003

Cited by 3 articles