Cascade Regression-Based Face Frontalization for Dynamic Facial Expression Analysis
Open Access
- 10 February 2021
- journal article
- research article
- Published by Springer Science and Business Media LLC in Cognitive Computation
- Vol. 14 (5), 1571-1584
- https://doi.org/10.1007/s12559-021-09843-8
Abstract
Facial expression recognition has seen rapid development in recent years due to its wide range of applications such as human–computer interaction, health care, and social robots. Although significant progress has been made in this field, it is still challenging to recognize facial expressions with occlusions and large head-poses. To address these issues, this paper presents a cascade regression-based face frontalization (CRFF) method, which aims to immediately reconstruct a clean, frontal and expression-aware face given an in-the-wild facial image. In the first stage, a frontal facial shape is predicted by developing a cascade regression model to learn the pairwise spatial relation between non-frontal face-shape and its frontal counterpart. Unlike most existing shape prediction methods that used single-step regression, the cascade model is a multi-step regressor that gradually aligns non-frontal shape to its frontal view. We employ several different regressors and make a ensemble decision to boost prediction performance. For facial texture reconstruction, active appearance model instantiation is employed to warp the input face to the predicted frontal shape and generate a clean face. To remove occlusions, we train this generative model on manually selected clean-face sets, which ensures generating a clean face as output regardless of whether the input face involves occlusions or not. Unlike the existing face reconstruction methods that are computational expensive, the proposed method works in real time, so it is suitable for dynamic analysis of facial expression. The experimental validation shows that the ensembling cascade model has improved frontal shape prediction accuracy for an average of 5% and the proposed method has achieved superior performance on both static and dynamic recognition of facial expressions over the state-of-the-art approaches. The experimental results demonstrate that the proposed method has achieved expression-preserving frontalization, de-occlusion and has improved performance of facial expression recognition.Keywords
Funding Information
- Engineering and Physical Sciences Research Council (EP/N025849/1)
This publication has 46 references indexed in Scilit:
- Automatic Analysis of Facial Affect: A Survey of Registration, Representation, and RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence, 2014
- An Accurate Algorithm for Generating a Music Playlist based on Facial ExpressionsInternational Journal of Computer Applications, 2014
- Structure-Preserving Sparse Decomposition for Facial Expression AnalysisIEEE Transactions on Image Processing, 2014
- A Dynamic Appearance Descriptor Approach to Facial Actions Temporal ModelingIEEE Transactions on Cybernetics, 2013
- Multi-view Facial Expression Recognition Analysis with Generic Sparse Coding FeatureLecture Notes in Computer Science, 2012
- Facial expression recognition based on Local Binary Patterns: A comprehensive studyImage and Vision Computing, 2009
- Human Action Recognition Using LBP-TOP as Sparse Spatio-Temporal Feature DescriptorLecture Notes in Computer Science, 2009
- Active Appearance Models RevisitedInternational Journal of Computer Vision, 2004
- Nonrigid registration using free-form deformations: application to breast MR imagesIEEE Transactions on Medical Imaging, 1999
- Active appearance modelsLecture Notes in Computer Science, 1998