SADG: Self-Aligned Dual NIR-VIS Generation for Heterogeneous Face Recognition
Open Access
- 22 January 2021
- journal article
- research article
- Published by MDPI AG in Applied Sciences
- Vol. 11 (3), 987
- https://doi.org/10.3390/app11030987
Abstract
Heterogeneous face recognition (HFR) has aroused significant interest in recent years, with some challenging tasks such as misalignment problems and limited HFR data. Misalignment occurs among different modalities’ images mainly because of misaligned semantics. Although recent methods have attempted to settle the low-shot problem, they suffer from the misalignment problem between paired near infrared (NIR) and visible (VIS) images. Misalignment can bring performance degradation to most image-to-image translation networks. In this work, we propose a self-aligned dual generation (SADG) architecture for generating semantics-aligned pairwise NIR-VIS images with the same identity, but without the additional guidance of external information learning. Specifically, we propose a self-aligned generator to align the data distributions between two modalities. Then, we present a multiscale patch discriminator to get high quality images. Furthermore, we raise the mean landmark distance (MLD) to test the alignment performance between NIR and VIS images with the same identity. Extensive experiments and an ablation study of SADG on three public datasets show significant alignment performance and recognition results. Specifically, the Rank1 accuracy achieved was close to 99.9% for the CASIA NIR-VIS 2.0, Oulu-CASIA NIR-VIS and BUAA VIS-NIR datasets, respectively.This publication has 22 references indexed in Scilit:
- Heterogeneous Face Recognition by Margin-Based Cross-Modality Metric LearningIEEE Transactions on Cybernetics, 2018
- Image-to-Image Translation with Conditional Adversarial NetworksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2017
- Deep Perceptual Mapping for Cross-Modal Face RecognitionInternational Journal of Computer Vision, 2016
- Multi-task clustering ELM for VIS-NIR cross-modal feature learningMultidimensional Systems and Signal Processing, 2016
- Multi-View Discriminant AnalysisIEEE Transactions on Pattern Analysis and Machine Intelligence, 2015
- Extensive Facial Landmark Localization with Coarse-to-Fine Convolutional Network CascadePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- The CASIA NIR-VIS 2.0 Face DatabasePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- Deep Convolutional Network Cascade for Facial Point DetectionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- Coupled Discriminant Analysis for Heterogeneous Face RecognitionIEEE Transactions on Information Forensics and Security, 2012
- Gradient-based learning applied to document recognitionProceedings of the IEEE, 1998