Script independent feature set for handwritten text recognition
- 1 May 2014
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE) in 2014 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)
- p. 1147-1152
- https://doi.org/10.1109/mipro.2014.6859741
Abstract
The efficiency of any character recognition technique is directly dependent on the accuracy of the generated feature set which could uniquely represent a character and hence correctly recognize it. This paper proposes a hybrid approach combining the structural features of the character and a mathematical model of curve fitting to simulate the best features of a character. As a preprocessing step the character is binarized and transformed to a thinned skeleton and the spurious edges are removed. Then, a combination of structural features of the character like number of end points, loops and intersection points are calculated. Further, the thinned character image is statistically zoned into partitions and quadratic curve fitting model is applied on each partition forming a feature vector of coefficients of the curve. This vector is combined with the spatial distribution of the foreground pixels for each zone and hence script independent feature representation. The approach has been evaluated experimentally on English and Hindi scripts. The algorithm achieves as average recognition accuracy of 89% for any script without incorporating any script specific features.Keywords
This publication has 7 references indexed in Scilit:
- A novel feature extraction technique for the recognition of segmented handwritten charactersPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2020
- Development of Comprehensive Devnagari Numeral and Character Database for Offline Handwritten Character RecognitionApplied Computational Intelligence and Soft Computing, 2012
- Neural network based handwritten character recognition system without feature extractionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2011
- Moment based invariant feature extraction techniques for bilingual character recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2010
- Handwritten Numeral Databases of Indian Scripts and Multistage Recognition of Mixed NumeralsIEEE Transactions on Pattern Analysis and Machine Intelligence, 2008
- Fuzzy Model Based Recognition of Handwritten Hindi CharactersPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- Feature extraction with wavelet transform for recognition of isolated handwritten Farsi/Arabic characters and numeralsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003