Beyond PASCAL: A benchmark for 3D object detection in the wild
Top Cited Papers
- 1 March 2014
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 15505790,p. 75-82
- https://doi.org/10.1109/wacv.2014.6836101
Abstract
3D object detection and pose estimation methods have become popular in recent years since they can handle ambiguities in 2D images and also provide a richer description for objects compared to 2D object detectors. However, most of the datasets for 3D recognition are limited to a small amount of images per category or are captured in controlled environments. In this paper, we contribute PASCAL3D+ dataset, which is a novel and challenging dataset for 3D object detection and pose estimation. PASCAL3D+ augments 12 rigid categories of the PASCAL VOC 2012 [4] with 3D annotations. Furthermore, more images are added for each category from ImageNet [3]. PASCAL3D+ images exhibit much more variability compared to the existing 3D datasets, and on average there are more than 3,000 object instances per category. We believe this dataset will provide a rich testbed to study 3D detection and pose estimation and will help to significantly push forward research in this area. We provide the results of variations of DPM [6] on our new dataset for object detection and viewpoint estimation in different scenarios, which can be used as baselines for the community. Our benchmark is available online at http://cvgl.stanford.edu/projects/pascal3d.This publication has 19 references indexed in Scilit:
- NYC3DCars: A Dataset of 3D Vehicles in Geographic ContextPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- SUN3D: A Database of Big Spaces Reconstructed Using SfM and Object LabelsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- Parsing IKEA Objects: Fine Pose EstimationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- The Pascal Visual Object Classes (VOC) ChallengeInternational Journal of Computer Vision, 2009
- From Images to Shape Models for Object DetectionInternational Journal of Computer Vision, 2009
- EPnP: An Accurate O(n) Solution to the PnP ProblemInternational Journal of Computer Vision, 2009
- Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categoriesComputer Vision and Image Understanding, 2007
- 3D generic object categorization, localization and pose estimationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- Analyzing appearance and contour based methods for object categorizationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Fast and globally convergent pose estimation from video imagesIEEE Transactions on Pattern Analysis and Machine Intelligence, 2000