Fast and Robust Multi-Person 3D Pose Estimation From Multiple Views
- 1 June 2019
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE) in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
- p. 7784-7793
- https://doi.org/10.1109/cvpr.2019.00798
Abstract
This paper addresses the problem of 3D pose estimation for multiple people in a few calibrated camera views. The main challenge of this problem is to find the cross-view correspondences among noisy and incomplete 2D pose predictions. Most previous methods address this challenge by directly reasoning in 3D using a pictorial structure model, which is inefficient due to the huge state space. We propose a fast and robust approach to solve this problem. Our key idea is to use a multi-way matching algorithm to cluster the detected 2D poses in all views. Each resulting cluster encodes 2D poses of the same person across different views and consistent correspondences across the keypoints, from which the 3D pose of each person can be effectively inferred. The proposed convex optimization based multi-way matching algorithm is efficient and robust against missing and false detections, without knowing the number of people in the scene. Moreover, we propose to combine geometric and appearance cues for cross-view matching. The proposed approach achieves significant performance gains from the state-of-the-art (96.3% vs. 90.6% and 96.9% vs. 88% on the Campus and Shelf datasets, respectively), while being efficient for real-time applications.Keywords
This publication has 35 references indexed in Scilit:
- 3D Pictorial Structures Revisited: Multiple Human Pose EstimationIEEE Transactions on Pattern Analysis and Machine Intelligence, 2015
- Efficient ConvNet-based marker-less motion capture in general scenes with a low number of camerasPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2015
- 3D Pictorial Structures for Multiple Human Pose EstimationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2014
- DeepPose: Human Pose Estimation via Deep Neural NetworksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2014
- Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural EnvironmentsIEEE Transactions on Pattern Analysis and Machine Intelligence, 2013
- Consistent Shape Maps via Semidefinite ProgrammingComputer Graphics Forum, 2013
- 3D Pictorial Structures for Multiple View Articulated Pose EstimationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- Loose-limbed People: Estimating 3D Human Pose and Motion Using Non-parametric Belief PropagationInternational Journal of Computer Vision, 2011
- A Singular Value Thresholding Algorithm for Matrix CompletionSIAM Journal on Optimization, 2010
- Distributed Optimization and Statistical Learning via the Alternating Direction Method of MultipliersFoundations and Trends® in Machine Learning, 2010