Multi-source Deep Learning for Human Pose Estimation
Top Cited Papers
- 1 June 2014
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 2337-2344
- https://doi.org/10.1109/cvpr.2014.299
Abstract
Visual appearance score, appearance mixture type and deformation are three important information sources for human pose estimation. This paper proposes to build a multi-source deep model in order to extract non-linear representation from these different aspects of information sources. With the deep model, the global, high-order human body articulation patterns in these information sources are extracted for pose estimation. The task for estimating body locations and the task for human detection are jointly learned using a unified deep model. The proposed approach can be viewed as a post-processing of pose estimation results and can flexibly integrate with existing methods by taking their information sources as input. By extracting the non-linear representation from multiple information sources, the deep model outperforms state-of-the-art by up to 8.6 percent on three public benchmark datasets.Keywords
This publication has 41 references indexed in Scilit:
- Joint Deep Learning for Pedestrian DetectionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- Strong Appearance and Expressive Spatial Models for Human Pose EstimationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- Modeling Mutual Visibility Relationship in Pedestrian DetectionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- Articulated people detection and pose estimation: Reshaping the futurePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2012
- Sum-product networks: A new deep architecturePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2011
- Learning effective human pose estimation from inaccurate annotationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2011
- Stacks of convolutional Restricted Boltzmann Machines for shift-invariant feature learningPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- Recovering 3D human body configurations using shape contextsIEEE Transactions on Pattern Analysis and Machine Intelligence, 2006
- Factor graphs and the sum-product algorithmIEEE Transactions on Information Theory, 2001
- Gradient-based learning applied to document recognitionProceedings of the IEEE, 1998