Beyond PASCAL: A benchmark for 3D object detection in the wild

Top Cited Papers

1 March 2014

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

No. 15505790,p. 75-82
https://doi.org/10.1109/wacv.2014.6836101

Abstract

3D object detection and pose estimation methods have become popular in recent years since they can handle ambiguities in 2D images and also provide a richer description for objects compared to 2D object detectors. However, most of the datasets for 3D recognition are limited to a small amount of images per category or are captured in controlled environments. In this paper, we contribute PASCAL3D+ dataset, which is a novel and challenging dataset for 3D object detection and pose estimation. PASCAL3D+ augments 12 rigid categories of the PASCAL VOC 2012 [4] with 3D annotations. Furthermore, more images are added for each category from ImageNet [3]. PASCAL3D+ images exhibit much more variability compared to the existing 3D datasets, and on average there are more than 3,000 object instances per category. We believe this dataset will provide a rich testbed to study 3D detection and pose estimation and will help to significantly push forward research in this area. We provide the results of variations of DPM [6] on our new dataset for object detection and viewpoint estimation in different scenarios, which can be used as baselines for the community. Our benchmark is available online at http://cvgl.stanford.edu/projects/pascal3d.

This publication has 19 references indexed in Scilit:

NYC3DCars: A Dataset of 3D Vehicles in Geographic Context
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2013
SUN3D: A Database of Big Spaces Reconstructed Using SfM and Object Labels
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2013
Parsing IKEA Objects: Fine Pose Estimation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2013
The Pascal Visual Object Classes (VOC) Challenge
International Journal of Computer Vision, 2009
From Images to Shape Models for Object Detection
International Journal of Computer Vision, 2009
EPnP: An Accurate O(n) Solution to the PnP Problem
International Journal of Computer Vision, 2009
Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories
Computer Vision and Image Understanding, 2007
3D generic object categorization, localization and pose estimation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2007
Analyzing appearance and contour based methods for object categorization
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Fast and globally convergent pose estimation from video images
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000

Cited by 430 articles