Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images

Top Cited Papers

5 September 2016

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Geoscience and Remote Sensing

Vol. 54 (12), 7405-7415
https://doi.org/10.1109/tgrs.2016.2601622

Abstract

Object detection in very high resolution optical remote sensing images is a fundamental problem faced for remote sensing image analysis. Due to the advances of powerful feature representations, machine-learning-based object detection is receiving increasing attention. Although numerous feature representations exist, most of them are handcrafted or shallow-learning-based features. As the object detection task becomes more challenging, their description capability becomes limited or even impoverished. More recently, deep learning algorithms, especially convolutional neural networks (CNNs), have shown their much stronger feature representation power in computer vision. Despite the progress made in nature scene images, it is problematic to directly use the CNN feature for object detection in optical remote sensing images because it is difficult to effectively deal with the problem of object rotation variations. To address this problem, this paper proposes a novel and effective approach to learn a rotation-invariant CNN (RICNN) model for advancing the performance of object detection, which is achieved by introducing and learning a new rotation-invariant layer on the basis of the existing CNN architectures. However, different from the training of traditional CNN models that only optimizes the multinomial logistic regression objective, our RICNN model is trained by optimizing a new objective function via imposing a regularization constraint, which explicitly enforces the feature representations of the training samples before and after rotating to be mapped close to each other, hence achieving rotation invariance. To facilitate training, we first train the rotation-invariant layer and then domain-specifically fine-tune the whole RICNN network to further boost the performance. Comprehensive evaluations on a publicly available ten-class object detection data set demonstrate the effectiveness of the proposed method.

Keywords

Funding Information

National Science Foundation of China (61401357, 61473231)
Fundamental Research Funds for the Central Universities (3102016ZY023)
Innovation Foundation for Doctor Dissertation of NPU (CX201622)
Aerospace Science Foundation of China (20140153003)

This publication has 48 references indexed in Scilit:

Approximative Bayes optimality linear discriminant analysis for Chinese handwriting character recognition
Neurocomputing, 2016
One-Class Classification of Remote Sensing Images Using Kernel Sparse Representation
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2016
Accurate Annotation of Remote Sensing Images via Active Spectral Clustering with Little Expert Knowledge
Remote Sensing, 2015
Auto‐encoder‐based shared mid‐level visual dictionary learning for scene classification using very high resolution remote sensing images
IET Computer Vision, 2015
Learning coarse-to-fine sparselets for efficient object detection and scene classification
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2015
Unsupervised Feature Learning Via Spectral Clustering of Multidimensional Patches for Remotely Sensed Scene Classification
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2015
Effective and Efficient Midlevel Visual Elements-Oriented Land-Use Classification Using VHR Remote Sensing Images
IEEE Transactions on Geoscience and Remote Sensing, 2015
Non-Local Sparse Unmixing for Hyperspectral Remote Sensing Imagery
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2013
Shape-based Invariant Texture Indexing
International Journal of Computer Vision, 2009
Histograms of Oriented Gradients for Human Detection
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005

Cited by 1334 articles