Deformable part models are convolutional neural networks

Top Cited Papers

1 June 2015

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 437-446
https://doi.org/10.1109/cvpr.2015.7298641

Abstract

Deformable part models (DPMs) and convolutional neural networks (CNNs) are two widely used tools for visual recognition. They are typically viewed as distinct approaches: DPMs are graphical models (Markov random fields), while CNNs are “black-box” non-linear classifiers. In this paper, we show that a DPM can be formulated as a CNN, thus providing a synthesis of the two ideas. Our construction involves unrolling the DPM inference algorithm and mapping each step to an equivalent CNN layer. From this perspective, it is natural to replace the standard image features used in DPMs with a learned feature extractor. We call the resulting model a DeepPyramid DPM and experimentally validate it on PASCAL VOC object detection. We find that DeepPyramid DPMs significantly outperform DPMs based on histograms of oriented gradients features (HOG) and slightly outperforms a comparable version of the recently introduced R-CNN detection system, while running significantly faster.

Keywords

Other Versions

Version 2, 2014-09-18, preprints

This publication has 18 references indexed in Scilit:

Caffe
Published by Association for Computing Machinery (ACM) ,2014
Bottom-Up Segmentation for Top-Down Detection
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2013
Sketch Tokens: A Learned Mid-level Representation for Contour and Object Detection
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2013
Selective Search for Object Recognition
International Journal of Computer Vision, 2013
The Pascal Visual Object Classes (VOC) Challenge
International Journal of Computer Vision, 2009
Histograms of Oriented Gradients for Human Detection
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Pictorial Structures for Object Recognition
International Journal of Computer Vision, 2005
Original approach for the localisation of objects in images
IEE Proceedings - Vision, Image, and Signal Processing, 1994
Backpropagation Applied to Handwritten Zip Code Recognition
Neural Computation, 1989
Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position
Biological Cybernetics, 1980

Cited by 225 articles