Use case visual Bag-of-Words techniques for camera based identity document classification
- 1 August 2015
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Nowadays, automatic identity document recognition, including passport and driving license recognition, is at the core of many applications within the administrative and service sectors, such as police, hospitality, car renting, etc. In former years, the document information was manually extracted whereas today this data is recognized automatically from images obtained by flat-bed scanners. Yet, since these scanners tend to be expensive and voluminous, companies in the sector have recently turned their attention to cheaper, small and yet computationally powerful scanners: the mobile devices. The document identity recognition from mobile images enclose several new difficulties w.r.t traditional scanned images, such as the loss of a controlled background, perspective, blurring, etc. In this paper we present a real application for identity document classification of images taken from mobile devices. This classification process is of extreme importance since a prior knowledge of the document type and origin strongly facilitates the subsequent information extraction. The proposed method is based on a traditional Bagof-Words in which we have taken into consideration several key aspects to enhance recognition rate. The method performance has been studied on three datasets containing more than 2000 images from 129 different document classes.Keywords
This publication has 12 references indexed in Scilit:
- Document Classification and Page Stream Segmentation for Digital Mailroom ApplicationsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- Real-time scene text localization and recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2012
- Content-based binary image retrieval using the adaptive hierarchical density histogramPattern Recognition, 2011
- Document Image Retrieval with Local Feature SequencesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- Speeded-Up Robust Features (SURF)Computer Vision and Image Understanding, 2008
- Document Images Retrieval Based on Multiple Features CombinationNinth International Conference on Document Analysis and Recognition (ICDAR 2007), 2007
- Rapid object detection using a boosted cascade of simple featuresPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Fine-grained document genre classification using first order random graphsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- A re-examination of text categorization methodsPublished by Association for Computing Machinery (ACM) ,1999
- Classification method study for automatic form class identificationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1998