One-Shot Fine-Grained Instance Retrieval
- 19 October 2017
- conference paper
- conference paper
- Published by Association for Computing Machinery (ACM) in Proceedings of the 25th ACM international conference on Multimedia
Abstract
Fine-Grained Visual Categorization (FGVC) has achieved significant progress recently. However, the number of fine-grained species could be huge and dynamically increasing in real scenarios, making it difficult to recognize unseen objects under the current FGVC framework. This raises an open issue to perform large-scale fine-grained identification without a complete training set. Aiming to conquer this issue, we propose a retrieval task named One-Shot Fine-Grained Instance Retrieval (OSFGIR). "One-Shot" denotes the ability of identifying unseen objects through a fine-grained retrieval task assisted with an incomplete auxiliary training set. This paper first presents the detailed description to OSFGIR task and our collected OSFGIR-378K dataset. Next, we propose the Convolutional and Normalization Networks (CN-Nets) learned on the auxiliary dataset to generate a concise and discriminative representation. Finally, we present a coarse-to-fine retrieval framework consisting of three components, i.e., coarse retrieval, fine-grained retrieval, and query expansion, respectively. The framework progressively retrieves images with similar semantics, and performs fine-grained identification. Experiments show our OSFGIR framework achieves significantly better accuracy and efficiency than existing FGVC and image retrieval methods, thus could be a better solution for large-scale fine-grained object identification.Keywords
Funding Information
- National Natural Science Foundation of China (61525206, 61572050, 91538111, 61620106009, 61429201)
This publication has 32 references indexed in Scilit:
- Fine-Grained Image SearchIEEE Transactions on Multimedia, 2015
- Local Alignments for Fine-Grained CategorizationInternational Journal of Computer Vision, 2014
- Cascade Category-Aware Visual SearchIEEE Transactions on Image Processing, 2014
- Improved Bird Species Recognition Using Pose Normalized Deep Convolutional NetsPublished by British Machine Vision Association and Society for Pattern Recognition ,2014
- Fine-Grained Categorization by AlignmentsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- 3D Object Representations for Fine-Grained CategorizationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- POOF: Part-Based One-vs.-One Features for Fine-Grained Categorization, Face Verification, and Attribute EstimationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- Hamming Embedding and Weak Geometric Consistency for Large Scale Image SearchLecture Notes in Computer Science, 2008
- Object retrieval with large vocabularies and fast spatial matchingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- Distinctive Image Features from Scale-Invariant KeypointsInternational Journal of Computer Vision, 2004