Receptive Fields Selection for Binary Feature Description

Abstract
Feature description for local image patch is widely used in computer vision. While the conventional way to design local descriptor is based on expert experience and knowledge, learning-based methods for designing local descriptor become more and more popular because of their good performance and data-driven property. This paper proposes a novel data-driven method for designing binary feature descriptor, which we call receptive fields descriptor (RFD). Technically, RFD is constructed by thresholding responses of a set of receptive fields, which are selected from a large number of candidates according to their distinctiveness and correlations in a greedy way. Using two different kinds of receptive fields (namely rectangular pooling area and Gaussian pooling area) for selection, we obtain two binary descriptors RFD R and RFD G accordingly. Image matching experiments on the well-known patch data set and Oxford data set demonstrate that RFD significantly outperforms the state-of-the-art binary descriptors, and is comparable with the best float-valued descriptors at a fraction of processing time. Finally, experiments on object recognition tasks confirm that both RFD R and RFD G successfully bridge the performance gap between binary descriptors and their floating-point competitors.
Funding Information
  • National Natural Science Foundation of China (61203277, 61272394, 91338202)
  • Beijing Natural Science Foundation (4142057)
  • Tsinghua National Laboratory for Information Science and Technology Cross-Discipline Foundation

This publication has 33 references indexed in Scilit: