RGB-(D) scene labeling: Features and algorithms

Abstract

Scene labeling research has mostly focused on outdoor scenes, leaving the harder case of indoor scenes poorly understood. Microsoft Kinect dramatically changed the landscape, showing great potentials for RGB-D perception (color+depth). Our main objective is to empirically understand the promises and challenges of scene labeling with RGB-D. We use the NYU Depth Dataset as collected and analyzed by Silberman and Fergus [30]. For RGB-D features, we adapt the framework of kernel descriptors that converts local similarities (kernels) to patch descriptors. For contextual modeling, we combine two lines of approaches, one using a superpixel MRF, and the other using a segmentation tree. We find that (1) kernel descriptors are very effective in capturing appearance (RGB) and shape (D) similarities; (2) both superpixel MRF and segmentation tree are useful in modeling context; and (3) the key to labeling accuracy is the ability to efficiently train and test with large-scale data. We improve labeling accuracy on the NYU Dataset from 56.6% to 76.1%. We also apply our approach to image-only scene labeling and improve the accuracy on the Stanford Background Dataset from 79.4% to 82.9%.

Keywords

This publication has 20 references indexed in Scilit:

KinectFusion
Published by Association for Computing Machinery (ACM) ,2011
A large-scale hierarchical multi-view RGB-D object dataset
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
Efficiently selecting regions for scene understanding
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2010
Recovering the spatial layout of cluttered rooms
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
Associative hierarchical CRFs for object class image segmentation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
Context by region ancestry
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
Recognizing indoor scenes
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
Geometric reasoning for single image structure recovery
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
Multiple Class Segmentation Using A Unified Framework over Mean-Shift Patches
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2007
Geometric context from a single image
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005

Cited by 12 articles