Deep Convolutional Network Cascade for Facial Point Detection
- 1 June 2013
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 3476-3483
- https://doi.org/10.1109/cvpr.2013.446
Abstract
We propose a new approach for estimation of the positions of facial key points with three-level carefully designed convolutional networks. At each level, the outputs of multiple networks are fused for robust and accurate estimation. Thanks to the deep structures of convolutional networks, global high-level features are extracted over the whole face region at the initialization stage, which help to locate high accuracy key points. There are two folds of advantage for this. First, the texture context information over the entire face is utilized to locate each key point. Second, since the networks are trained to predict all the key points simultaneously, the geometric constraints among key points are implicitly encoded. The method therefore can avoid local minimum caused by ambiguity and data corruption in difficult image samples due to occlusions, large pose variations, and extreme lightings. The networks at the following two levels are trained to locally refine initial predictions and their inputs are limited to small regions around the initial predictions. Several network structures critical for accurate and robust facial point detection are investigated. Extensive experiments show that our approach outperforms state-of-the-art methods in both detection accuracy and reliability.Keywords
This publication has 11 references indexed in Scilit:
- Learning Hierarchical Features for Scene LabelingIEEE Transactions on Pattern Analysis and Machine Intelligence, 2012
- Learning hierarchical representations for face verification with convolutional deep belief networksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2012
- Multi-column deep neural networks for image classificationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2012
- Real-time facial feature detection using conditional regression forestsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2012
- Optimal landmark detection using shape models and branch and boundPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2011
- Accurate Regression Procedures for Active Appearance ModelsPublished by British Machine Vision Association and Society for Pattern Recognition ,2011
- Facial point detection using boosted regression and graph modelsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2010
- What is the best multi-stage architecture for object recognition?Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- Generic Face Alignment using Boosted Appearance ModelPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- Efficient BackPropPublished by Springer Science and Business Media LLC ,1998