Viewpoints and keypoints
- 1 June 2015
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 1510-1519
- https://doi.org/10.1109/cvpr.2015.7298758
Abstract
We characterize the problem of pose estimation for rigid objects in terms of determining viewpoint to explain coarse pose and keypoint prediction to capture the finer details. We address both these tasks in two different settings - the constrained setting with known bounding boxes and the more challenging detection setting where the aim is to simultaneously detect and correctly estimate pose of objects. We present Convolutional Neural Network based architectures for these and demonstrate that leveraging viewpoint estimates can substantially improve local appearance based keypoint predictions. In addition to achieving significant improvements over state-of-the-art in the above tasks, we analyze the error modes and effect of object characteristics on performance to guide future efforts towards this goal.Keywords
Other Versions
This publication has 24 references indexed in Scilit:
- Rich Feature Hierarchies for Accurate Object Detection and Semantic SegmentationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2014
- DeepPose: Human Pose Estimation via Deep Neural NetworksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2014
- Beyond PASCAL: A benchmark for 3D object detection in the wildPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2014
- Is 2D Information Enough For Viewpoint Estimation?Published by British Machine Vision Association and Society for Pattern Recognition ,2014
- Teaching 3D geometry to deformable part modelsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2012
- Viewpoint-aware object detection and pose estimationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2011
- Articulated pose estimation with flexible mixtures-of-partsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2011
- 3D generic object categorization, localization and pose estimationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- Model-based vision: a program to see a walking personImage and Vision Computing, 1983
- Perception of wholes and of their component parts: Some configural superiority effects.Journal of Experimental Psychology: Human Perception and Performance, 1977