AUTOMATION OF INDIAN POSTAL DOCUMENTS WRITTEN IN BANGLA AND ENGLISH
- 1 December 2009
- journal article
- research article
- Published by World Scientific Pub Co Pte Ltd in International Journal of Pattern Recognition and Artificial Intelligence
- Vol. 23 (08), 1599-1632
- https://doi.org/10.1142/s0218001409007776
Abstract
In this paper, we present a system towards Indian postal automation based on pin-code and city name recognition. Here, at first, using Run Length Smoothing Approach (RLSA), non-text blocks (postal stamp, postal seal, etc.) are detected and using positional information, Destination Address Block (DAB) is identified from postal documents. Next, lines and words of the DAB are segmented. In India, the address part of a postal document may be written by a combination of two scripts: Latin (English) and a local (State/region) script. It is very difficult to identify the script by which pin-code part is written. To overcome this problem on pin-code part, we have used a two-stage artificial neural network based general scheme to recognize pin-code numbers written in any of the two scripts. To identify the script by which a word/city name is written, we propose a water reservoir concept based feature. For recognition of city names, we propose an NSHP-HMM (Non-Symmetric Half Plane-Hidden Markov Model) based technique. At present, the accuracy of the proposed digit numeral recognition module is 93.14% while that of city name recognition scheme is 86.44%.Keywords
This publication has 18 references indexed in Scilit:
- Indian script character recognition: a surveyPattern Recognition, 2004
- Multioriented and Curved Text Lines Extraction From Indian DocumentsIEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 2004
- Handwritten digit recognition: investigation of normalization and feature extraction techniquesPattern Recognition, 2004
- A HYBRID SCHEME FOR HANDPRINTED NUMERAL RECOGNITION BASED ON A SELF-ORGANIZING NETWORK AND MLP ClASSIFIERSInternational Journal of Pattern Recognition and Artificial Intelligence, 2002
- Cross-learning in analytic word recognition without segmentationInternational Journal on Document Analysis and Recognition (IJDAR), 2002
- Integration of structural and statistical information for unconstrained handwritten numeral recognitionIEEE Transactions on Pattern Analysis and Machine Intelligence, 1999
- An HMM-based approach for off-line unconstrained handwritten word modeling and recognitionIEEE Transactions on Pattern Analysis and Machine Intelligence, 1999
- A complete printed Bangla OCR systemPattern Recognition, 1998
- Bengali alpha-numeric character recognition using curvature featuresPattern Recognition, 1993
- On the relationship of the Markov mesh to the NSHP Markov chainPattern Recognition Letters, 1987