3D Traffic Scene Understanding From Movable Platforms
Top Cited Papers
- 27 September 2013
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Pattern Analysis and Machine Intelligence
- Vol. 36 (5), 1012-1025
- https://doi.org/10.1109/tpami.2013.185
Abstract
In this paper, we present a novel probabilistic generative model for multi-object traffic scene understanding from movable platforms which reasons jointly about the 3D scene layout as well as the location and orientation of objects in the scene. In particular, the scene topology, geometry, and traffic activities are inferred from short video sequences. Inspired by the impressive driving capabilities of humans, our model does not rely on GPS, lidar, or map knowledge. Instead, it takes advantage of a diverse set of visual cues in the form of vehicle tracklets, vanishing points, semantic scene labels, scene flow, and occupancy grids. For each of these cues, we propose likelihood functions that are integrated into a probabilistic generative model. We learn all model parameters from training data using contrastive divergence. Experiments conducted on videos of 113 representative intersections show that our approach successfully infers the correct layout in a variety of very challenging scenarios. To evaluate the importance of each feature cue, experiments using different feature combinations are conducted. Furthermore, we show how by employing context derived from the proposed method we are able to improve over the state-of-the-art in terms of object detection and object orientation estimation in challenging and cluttered urban environments.This publication has 52 references indexed in Scilit:
- Monocular Visual Scene Understanding: Understanding Multi-Object Traffic ScenesIEEE Transactions on Pattern Analysis and Machine Intelligence, 2012
- Are we ready for autonomous driving? The KITTI vision benchmark suitePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2012
- A generative model for 3D urban scene understanding from movable platformsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2011
- StereoScan: Dense 3d reconstruction in real-timePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2011
- Who are you with and where are you going?Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
- Multiple-Target Tracking by Spatiotemporal Monte Carlo Markov Chain Data AssociationIEEE Transactions on Pattern Analysis and Machine Intelligence, 2008
- Putting Objects in PerspectiveInternational Journal of Computer Vision, 2008
- Recovering Surface Layout from an ImageInternational Journal of Computer Vision, 2007
- Model-based recognition of intersections and lane structuresPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Training Products of Experts by Minimizing Contrastive DivergenceNeural Computation, 2002