Deep Convolutional Network Cascade for Facial Point Detection

1 June 2013

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 3476-3483
https://doi.org/10.1109/cvpr.2013.446

Abstract

We propose a new approach for estimation of the positions of facial key points with three-level carefully designed convolutional networks. At each level, the outputs of multiple networks are fused for robust and accurate estimation. Thanks to the deep structures of convolutional networks, global high-level features are extracted over the whole face region at the initialization stage, which help to locate high accuracy key points. There are two folds of advantage for this. First, the texture context information over the entire face is utilized to locate each key point. Second, since the networks are trained to predict all the key points simultaneously, the geometric constraints among key points are implicitly encoded. The method therefore can avoid local minimum caused by ambiguity and data corruption in difficult image samples due to occlusions, large pose variations, and extreme lightings. The networks at the following two levels are trained to locally refine initial predictions and their inputs are limited to small regions around the initial predictions. Several network structures critical for accurate and robust facial point detection are investigated. Extensive experiments show that our approach outperforms state-of-the-art methods in both detection accuracy and reliability.

Keywords

This publication has 11 references indexed in Scilit:

Learning Hierarchical Features for Scene Labeling
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012
Learning hierarchical representations for face verification with convolutional deep belief networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2012
Multi-column deep neural networks for image classification
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2012
Real-time facial feature detection using conditional regression forests
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2012
Optimal landmark detection using shape models and branch and bound
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
Accurate Regression Procedures for Active Appearance Models
Published by British Machine Vision Association and Society for Pattern Recognition ,2011
Facial point detection using boosted regression and graph models
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2010
What is the best multi-stage architecture for object recognition?
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
Generic Face Alignment using Boosted Appearance Model
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2007
Efficient BackProp
Published by Springer Science and Business Media LLC ,1998

Cited by 988 articles