A 3D object detection and pose estimation pipeline using RGB-D images

Abstract

3D object detection and pose estimation has been studied extensively in recent decades for its potential applications in robotics. However, there still remains challenges when we aim at detecting multiple objects while retaining low false positive rate in cluttered environments. This paper proposes a robust 3D object detection and pose estimation pipeline based on RGB-D images, which can detect multiple objects simultaneously while reducing false positives. Detection begins with template matching and yields a set of template matches. A clustering algorithm then groups templates of similar spatial location and produces multiple-object hypotheses. A scoring function evaluates the hypotheses using their associated templates and non-maximum suppression is adopted to remove duplicate results based on the scores. Finally, a combination of point cloud processing algorithms are used to compute objects' 3D poses. Existing object hypotheses are verified by computing the overlap between model and scene points. Experiments demonstrate that our approach provides competitive results comparable to the state-of-the-arts and can be applied to robot random bin-picking.

Keywords

This publication has 16 references indexed in Scilit:

A Comprehensive Performance Evaluation of 3D Local Feature Descriptors
International Journal of Computer Vision, 2015
Rotational Projection Statistics for 3D Local Surface Description and Object Recognition
International Journal of Computer Vision, 2013
Fast object localization and pose estimation in heavy clutter for robotic bin picking
The International Journal of Robotics Research, 2012
Multimodal templates for real-time detection of texture-less objects in heavily cluttered scenes
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
Gradient Response Maps for Real-Time Detection of Textureless Objects
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011
Speeded-Up Robust Features (SURF)
Computer Vision and Image Understanding, 2008
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision, 2004
Computing and rendering point set surfaces
IEEE Transactions on Visualization and Computer Graphics, 2003
Using spin images for efficient object recognition in cluttered 3D scenes
IEEE Transactions on Pattern Analysis and Machine Intelligence, 1999
A method for registration of 3-D shapes
IEEE Transactions on Pattern Analysis and Machine Intelligence, 1992

Cited by 10 articles