Locality-constrained and spatially regularized coding for scene categorization
- 1 June 2012
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 3618-3625
- https://doi.org/10.1109/cvpr.2012.6248107
Abstract
Improving coding and spatial pooling for bag-of-words based feature design have gained a lot of attention in recent works addressing object recognition and scene classification. Regarding the coding step in particular, properties such as sparsity, locality and saliency have been investigated. The main contribution of this work consists in taking into acount the local spatial context of an image into the usual coding strategies proposed in the state-of-the-art. For this purpose, given an imgae, dense local features are extracted and structured in a lattice. The latter is endowed with a neighborhood system and pairwise interactions. We propose a new objective function to encode local features, which preserves locality constraints both in the feature space and the spatial domain of the image. In addition, an appropriate efficient optimization algorithm is provided, inspired from the graph-cut framework. In conjunction with the maximum-pooling operation and the spatial pyramid matching, that reflects a global spatial layout, the proposed method improves the performances of several state-of-the-art coding schemes for scene classification on three publicly available benchmarks (UIUC 8-sport, Scene-15 and Caltech-101).Keywords
This publication has 22 references indexed in Scilit:
- Salient coding for image classificationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2011
- Local features are not lonely – Laplacian sparse coding for image classificationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2010
- SENSC: a Stable and Efficient Algorithm for Nonnegative Sparse CodingActa Automatica Sinica, 2009
- Discriminative learned dictionaries for local image analysisPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2008
- Supervised Learning of Quantizer Codebooks by Information Loss MinimizationIeee Transactions On Pattern Analysis and Machine Intelligence, 2008
- What, where and who? Classifying events by scene and object recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene CategoriesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- Distinctive Image Features from Scale-Invariant KeypointsInternational Journal of Computer Vision, 2004
- What energy functions can be minimized via graph cuts?Ieee Transactions On Pattern Analysis and Machine Intelligence, 2004
- Sparse coding with an overcomplete basis set: A strategy employed by V1?Vision Research, 1997