Fine-Grained Image Search
- 4 March 2015
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Multimedia
- Vol. 17 (5), 636-647
- https://doi.org/10.1109/tmm.2015.2408566
Abstract
Large-scale image search has been attracting lots of attention from both academic and commercial fields. The conventional bag-of-visual-words (BoVW) model with inverted index is verified efficient at retrieving near-duplicate images, but it is less capable of discovering fine-grained concepts in the query and returning semantically matched search results. In this paper, we suggest that instance search should return not only near-duplicate images, but also fine-grained results, which is usually the actual intention of a user. We propose a new and interesting problem named fine-grained image search, which means that we prefer those images containing the same fine-grained concept with the query. We formulate the problem by constructing a hierarchical database and defining an evaluation method. We thereafter introduce a baseline system using fine-grained classification scores to represent and co-index images so that the semantic attributes are better incorporated in the online querying stage. Large-scale experiments reveal that promising search results are achieved with reasonable time and memory consumption. We hope this paper will be the foundation for future work on image search. We also expect more follow-up efforts along this research topic and look forward to commercial fine-grained image search engines.Keywords
Funding Information
- National Basic Research Program (973 Program) of China (2013CB329403, 2012CB316301, 2014CB347600)
- National Natural Science Foundation of China (61332007, 61273023, 61429201)
- Tsinghua University Initiative Scientific Research Program (20121088071)
- NEC Laboratories of America Faculty Research Awards (W911NF-12-1-0057)
This publication has 58 references indexed in Scilit:
- Spatial Pooling of Heterogeneous Features for Image ClassificationIEEE Transactions on Image Processing, 2014
- Contextual Hashing for Large-Scale Image SearchIEEE Transactions on Image Processing, 2014
- Learning Query-Specific Distance Functions for Large-Scale Web Image SearchIEEE Transactions on Multimedia, 2013
- Trinary-Projection Trees for Approximate Nearest Neighbor SearchIEEE Transactions on Pattern Analysis and Machine Intelligence, 2013
- Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categoriesComputer Vision and Image Understanding, 2007
- A survey of content-based image retrieval with high-level semanticsPattern Recognition, 2006
- Distinctive Image Features from Scale-Invariant KeypointsInternational Journal of Computer Vision, 2004
- Scale & Affine Invariant Interest Point DetectorsInternational Journal of Computer Vision, 2004
- Cumulated gain-based evaluation of IR techniquesACM Transactions on Information Systems, 2002
- Visual information retrieval from large distributed online repositoriesCommunications of the ACM, 1997