Separating Indic Scripts with matra for Effective Handwritten Script Identification in Multi-Script Documents
- 27 February 2017
- journal article
- research article
- Published by World Scientific Pub Co Pte Ltd in International Journal of Pattern Recognition and Artificial Intelligence
- Vol. 31 (05)
- https://doi.org/10.1142/s0218001417530032
Abstract
We present a novel approach for separating Indic scripts with ‘matra’, which is used as a precursor to advance and/or ease subsequent handwritten script identification in multi-script documents. In our study, among state-of-the-art features and classifiers, an optimized fractal geometry analysis and random forest are found to be the best performer to distinguish scripts with ‘matra’ from their counterparts. For validation, a total of 1204 document images are used, where two different scripts with ‘matra’: Bangla and Devanagari are considered as positive samples and the other two different scripts: Roman and Urdu are considered as negative samples. With this precursor, an overall script identification performance can be advanced by more than 5.13% in accuracy and 1.17 times faster in processing time as compared to conventional system.Keywords
This publication has 11 references indexed in Scilit:
- SOFT-ASSIGNMENT RANDOM-FOREST WITH AN APPLICATION TO DISCRIMINATIVE REPRESENTATION OF HUMAN ACTIONS IN VIDEOSInternational Journal of Pattern Recognition and Artificial Intelligence, 2013
- A System for Handwritten Script Identification From Indian DocumentJournal of Pattern Recognition Research, 2013
- DATASET AND GROUND TRUTH FOR HANDWRITTEN TEXT IN FOUR DIFFERENT SCRIPTSInternational Journal of Pattern Recognition and Artificial Intelligence, 2012
- Fourier Descriptor based Isolated Marathi Handwritten Numeral RecognitionInternational Journal of Computer Applications, 2010
- Script Recognition—A ReviewIEEE Transactions on Pattern Analysis and Machine Intelligence, 2010
- AUTOMATION OF INDIAN POSTAL DOCUMENTS WRITTEN IN BANGLA AND ENGLISHInternational Journal of Pattern Recognition and Artificial Intelligence, 2009
- A NEURAL NETWORK APPROACH TO REAL-TIME PATTERN RECOGNITIONInternational Journal of Pattern Recognition and Artificial Intelligence, 2001
- Script and language identification for handwritten document imagesInternational Journal on Document Analysis and Recognition (IJDAR), 1999
- A Bayesian method for the induction of probabilistic networks from dataMachine Learning, 1992
- A Computational Approach to Edge DetectionIEEE Transactions on Pattern Analysis and Machine Intelligence, 1986