SADG: Self-Aligned Dual NIR-VIS Generation for Heterogeneous Face Recognition

Open Access

22 January 2021

journal article
research article
Published by MDPI AG in Applied Sciences

Vol. 11 (3), 987
https://doi.org/10.3390/app11030987

Abstract

Heterogeneous face recognition (HFR) has aroused significant interest in recent years, with some challenging tasks such as misalignment problems and limited HFR data. Misalignment occurs among different modalities’ images mainly because of misaligned semantics. Although recent methods have attempted to settle the low-shot problem, they suffer from the misalignment problem between paired near infrared (NIR) and visible (VIS) images. Misalignment can bring performance degradation to most image-to-image translation networks. In this work, we propose a self-aligned dual generation (SADG) architecture for generating semantics-aligned pairwise NIR-VIS images with the same identity, but without the additional guidance of external information learning. Specifically, we propose a self-aligned generator to align the data distributions between two modalities. Then, we present a multiscale patch discriminator to get high quality images. Furthermore, we raise the mean landmark distance (MLD) to test the alignment performance between NIR and VIS images with the same identity. Extensive experiments and an ablation study of SADG on three public datasets show significant alignment performance and recognition results. Specifically, the Rank1 accuracy achieved was close to 99.9% for the CASIA NIR-VIS 2.0, Oulu-CASIA NIR-VIS and BUAA VIS-NIR datasets, respectively.

This publication has 22 references indexed in Scilit:

Heterogeneous Face Recognition by Margin-Based Cross-Modality Metric Learning
IEEE Transactions on Cybernetics, 2018
Image-to-Image Translation with Conditional Adversarial Networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2017
Deep Perceptual Mapping for Cross-Modal Face Recognition
International Journal of Computer Vision, 2016
Multi-task clustering ELM for VIS-NIR cross-modal feature learning
Multidimensional Systems and Signal Processing, 2016
Multi-View Discriminant Analysis
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015
Extensive Facial Landmark Localization with Coarse-to-Fine Convolutional Network Cascade
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2013
The CASIA NIR-VIS 2.0 Face Database
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2013
Deep Convolutional Network Cascade for Facial Point Detection
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2013
Coupled Discriminant Analysis for Heterogeneous Face Recognition
IEEE Transactions on Information Forensics and Security, 2012
Gradient-based learning applied to document recognition
Proceedings of the IEEE, 1998

Cited by 2 articles