Resolving Ambiguous Hand Pose Predictions by Exploiting Part Correlations

16 October 2014

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Circuits and Systems for Video Technology

Vol. 25 (7), 1125-1139
https://doi.org/10.1109/tcsvt.2014.2363750

Abstract

The positions of the hand joints are important high-level features for hand-based human-computer interaction. We present a novel method to predict the 3-D joint positions from the depth images and the parsed hand parts obtained with a pretrained classifier. The hand parts are utilized as the additional cue to resolve the multimodal predictions produced by the previous regression-based method without increasing the computational cost significantly. In addition, we further enforce the hand motion constraints to fuse the per-pixel prediction results. The posterior distribution of the joints is formulated as a weighted product of experts model based on the individual pixel predictions, which is maximized via the expectation-maximization algorithm on a learned low-dimensional space of the hand joint parameters. The experimental results show the proposed method improves the prediction accuracy considerably compared with the rivals that also regress for the joint locations from the depth images. Especially, we show that the regressor learned on synthesized dataset also gives accurate prediction on real-world depth images by enforcing the hand part correlations despite their discrepancies.

Keywords

Funding Information

National Research Foundation Singapore, Singapore, through the International Research Centre within the Singapore Funding Initiative and administered by the IDM Programme Office, which was carried out at the BeingThere Centre, Institute of Media Innovatio

This publication has 28 references indexed in Scilit:

Hierarchically constrained 3D hand pose estimation using regression forests from single frame depth data
Pattern Recognition Letters, 2014
Parsing the Hand in Depth Images
IEEE Transactions on Multimedia, 2014
Human Pose Estimation Using Body Parts Dependent Joint Regressors
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2013
Model-based hand pose estimation via spatial-temporal hand parsing and 3D fingertip localization
The Visual Computer, 2013
Accurate realtime full-body motion capture using a single depth camera
ACM Transactions on Graphics, 2012
Efficient model-based 3D tracking of hand articulations using Kinect
Published by British Machine Vision Association and Society for Pattern Recognition ,2011
Optimizing walking controllers
ACM Transactions on Graphics, 2009
Pictorial Structures for Object Recognition
International Journal of Computer Vision, 2005
Mean shift: a robust approach toward feature space analysis
Ieee Transactions On Pattern Analysis and Machine Intelligence, 2002
Model-based 3D hand posture estimation from a single 2D image
Image and Vision Computing, 2002

Cited by 20 articles