MIDV-500: a dataset for identity document analysis and recognition on mobile devices in video stream

Open Access

1 October 2019

journal article
research article
Published by Samara National Research University in Computer Optics

Vol. 43 (5), 818-824
https://doi.org/10.18287/2412-6179-2019-43-5-818-824

Abstract

A lot of research has been devoted to identity documents analysis and recognition on mobile devices. However, no publicly available datasets designed for this particular problem currently exist. There are a few datasets which are useful for associated subtasks but in order to facilitate a more comprehensive scientific and technical approach to identity document recognition more specialized datasets are required. In this paper we present a Mobile Identity Document Video dataset (MIDV-500) consisting of 500 video clips for 50 different identity document types with ground truth which allows to perform research in a wide scope of document analysis problems. The paper presents characteristics of the dataset and evaluation results for existing methods of face detection, text line recognition, and document fields data extraction. Since an important feature of identity documents is their sensitiveness as they contain personal data, all source document images used in MIDV-500 are either in public domain or distributed under public copyright licenses. The main goal of this paper is to present a dataset. However, in addition and as a baseline, we present evaluation results for existing methods for face detection, text line recognition, and document data extraction, using the presented dataset.

Keywords

Funding Information

Российский Фонд Фундаментальных Исследований (17-29-03170, 17-29-03370)

This publication has 26 references indexed in Scilit:

Money Laundering Compliance—The Challenges of Technology
Published by Springer Science and Business Media LLC ,2016
SmartDoc-QA: A dataset for quality assessment of smartphone captured document images - single and multiple distortions
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2015
Evaluation of deep convolutional nets for document image classification and retrieval
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2015
ICDAR2015 competition on smartphone document capture and OCR (SmartDoc)
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2015
Use case visual Bag-of-Words techniques for camera based identity document classification
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2015
Fine-grained classification of identity document types with only one example
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2015
Real time rectangular document detection on mobile devices
Published by SPIE-Intl Soc Optical Eng ,2015
A Dataset for Quality Assessment of Camera Captured Document Images
Lecture Notes in Computer Science, 2014
Quality based frame selection for video face recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2012
An Overview of the Tesseract OCR Engine
Ninth International Conference on Document Analysis and Recognition (ICDAR 2007), 2007

Cited by 57 articles