Script identification from handwritten documents using SIFT method
- 1 September 2017
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE) in 2017 IEEE International Conference on Power, Control, Signals and Instrumentation Engineering (ICPCSI)
Abstract
Automatic identification of scripts from document images helps selecting appropriate OCR for character recognition and content retrieval. In this paper, Scale invariant Feature Transformation (SIFT) based script identification has been proposed. Features are extracted using SIFT approach at word level (two, three or more character words) and KNN classifier has been used to recognize the script. Experiments are performed by extracting the words from document images consisting of English, Kannada, and Devanagari scripts. Overall accuracy reported for the proposed system is 97.65% and 96.71% for bi-script and tri-scripts, respectively.Keywords
This publication has 9 references indexed in Scilit:
- Directional Discrete Cosine Transform for Handwritten Script IdentificationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- Script Identification from Handwritten DocumentPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2011
- A novel framework for automatic sorting of postal documents with multi-script address blocksPattern Recognition, 2010
- Script identification in a handwritten document image using texture featuresPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2010
- Bangla/English Script Identification Based on Analysis of Connected Component ProfilesLecture Notes in Computer Science, 2006
- Script Identification Based on Morphological Reconstruction in Document ImagesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- Distinctive Image Features from Scale-Invariant KeypointsInternational Journal of Computer Vision, 2004
- Script-based classification of hand-written text documents in a multilingual environmentPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Script and language identification for handwritten document imagesInternational Journal on Document Analysis and Recognition (IJDAR), 1999