Deformable part models are convolutional neural networks
Top Cited Papers
- 1 June 2015
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Deformable part models (DPMs) and convolutional neural networks (CNNs) are two widely used tools for visual recognition. They are typically viewed as distinct approaches: DPMs are graphical models (Markov random fields), while CNNs are “black-box” non-linear classifiers. In this paper, we show that a DPM can be formulated as a CNN, thus providing a synthesis of the two ideas. Our construction involves unrolling the DPM inference algorithm and mapping each step to an equivalent CNN layer. From this perspective, it is natural to replace the standard image features used in DPMs with a learned feature extractor. We call the resulting model a DeepPyramid DPM and experimentally validate it on PASCAL VOC object detection. We find that DeepPyramid DPMs significantly outperform DPMs based on histograms of oriented gradients features (HOG) and slightly outperforms a comparable version of the recently introduced R-CNN detection system, while running significantly faster.Keywords
Other Versions
This publication has 18 references indexed in Scilit:
- CaffePublished by Association for Computing Machinery (ACM) ,2014
- Bottom-Up Segmentation for Top-Down DetectionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- Sketch Tokens: A Learned Mid-level Representation for Contour and Object DetectionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- Selective Search for Object RecognitionInternational Journal of Computer Vision, 2013
- The Pascal Visual Object Classes (VOC) ChallengeInternational Journal of Computer Vision, 2009
- Histograms of Oriented Gradients for Human DetectionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Pictorial Structures for Object RecognitionInternational Journal of Computer Vision, 2005
- Original approach for the localisation of objects in imagesIEE Proceedings - Vision, Image, and Signal Processing, 1994
- Backpropagation Applied to Handwritten Zip Code RecognitionNeural Computation, 1989
- Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in positionBiological Cybernetics, 1980