Multi-source Deep Learning for Human Pose Estimation

Top Cited Papers

1 June 2014

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 2337-2344
https://doi.org/10.1109/cvpr.2014.299

Abstract

Visual appearance score, appearance mixture type and deformation are three important information sources for human pose estimation. This paper proposes to build a multi-source deep model in order to extract non-linear representation from these different aspects of information sources. With the deep model, the global, high-order human body articulation patterns in these information sources are extracted for pose estimation. The task for estimating body locations and the task for human detection are jointly learned using a unified deep model. The proposed approach can be viewed as a post-processing of pose estimation results and can flexibly integrate with existing methods by taking their information sources as input. By extracting the non-linear representation from multiple information sources, the deep model outperforms state-of-the-art by up to 8.6 percent on three public benchmark datasets.

Keywords

This publication has 41 references indexed in Scilit:

Joint Deep Learning for Pedestrian Detection
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2013
Strong Appearance and Expressive Spatial Models for Human Pose Estimation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2013
Modeling Mutual Visibility Relationship in Pedestrian Detection
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2013
Articulated people detection and pose estimation: Reshaping the future
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2012
Sum-product networks: A new deep architecture
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
Learning effective human pose estimation from inaccurate annotation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
Stacks of convolutional Restricted Boltzmann Machines for shift-invariant feature learning
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
Recovering 3D human body configurations using shape contexts
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2006
Factor graphs and the sum-product algorithm
IEEE Transactions on Information Theory, 2001
Gradient-based learning applied to document recognition
Proceedings of the IEEE, 1998

Cited by 189 articles