Cascade Regression-Based Face Frontalization for Dynamic Facial Expression Analysis

Open Access

10 February 2021

journal article
research article
Published by Springer Science and Business Media LLC in Cognitive Computation

Vol. 14 (5), 1571-1584
https://doi.org/10.1007/s12559-021-09843-8

Abstract

Facial expression recognition has seen rapid development in recent years due to its wide range of applications such as human–computer interaction, health care, and social robots. Although significant progress has been made in this field, it is still challenging to recognize facial expressions with occlusions and large head-poses. To address these issues, this paper presents a cascade regression-based face frontalization (CRFF) method, which aims to immediately reconstruct a clean, frontal and expression-aware face given an in-the-wild facial image. In the first stage, a frontal facial shape is predicted by developing a cascade regression model to learn the pairwise spatial relation between non-frontal face-shape and its frontal counterpart. Unlike most existing shape prediction methods that used single-step regression, the cascade model is a multi-step regressor that gradually aligns non-frontal shape to its frontal view. We employ several different regressors and make a ensemble decision to boost prediction performance. For facial texture reconstruction, active appearance model instantiation is employed to warp the input face to the predicted frontal shape and generate a clean face. To remove occlusions, we train this generative model on manually selected clean-face sets, which ensures generating a clean face as output regardless of whether the input face involves occlusions or not. Unlike the existing face reconstruction methods that are computational expensive, the proposed method works in real time, so it is suitable for dynamic analysis of facial expression. The experimental validation shows that the ensembling cascade model has improved frontal shape prediction accuracy for an average of 5% and the proposed method has achieved superior performance on both static and dynamic recognition of facial expressions over the state-of-the-art approaches. The experimental results demonstrate that the proposed method has achieved expression-preserving frontalization, de-occlusion and has improved performance of facial expression recognition.

Keywords

Funding Information

Engineering and Physical Sciences Research Council (EP/N025849/1)

This publication has 46 references indexed in Scilit:

Automatic Analysis of Facial Affect: A Survey of Registration, Representation, and Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014
An Accurate Algorithm for Generating a Music Playlist based on Facial Expressions
International Journal of Computer Applications, 2014
Structure-Preserving Sparse Decomposition for Facial Expression Analysis
IEEE Transactions on Image Processing, 2014
A Dynamic Appearance Descriptor Approach to Facial Actions Temporal Modeling
IEEE Transactions on Cybernetics, 2013
Multi-view Facial Expression Recognition Analysis with Generic Sparse Coding Feature
Lecture Notes in Computer Science, 2012
Facial expression recognition based on Local Binary Patterns: A comprehensive study
Image and Vision Computing, 2009
Human Action Recognition Using LBP-TOP as Sparse Spatio-Temporal Feature Descriptor
Lecture Notes in Computer Science, 2009
Active Appearance Models Revisited
International Journal of Computer Vision, 2004
Nonrigid registration using free-form deformations: application to breast MR images
IEEE Transactions on Medical Imaging, 1999
Active appearance models
Lecture Notes in Computer Science, 1998

Cited by 15 articles