Tag localization with spatial correlations and joint group sparsity
- 1 June 2011
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Nowadays numerous social images have been emerging on the Web. How to precisely label these images is critical to image retrieval. However, traditional image-level tagging methods may become less effective because global image matching approaches can hardly cope with the diversity and arbitrariness of Web image content. This raises an urgent need for the fine-grained tagging schemes. In this work, we study how to establish mapping between tags and image regions, i.e. localize tags to image regions, so as to better depict and index the content of images. We propose the spatial group sparse coding (SGSC) by extending the robust encoding ability of group sparse coding with spatial correlations among training regions. We present spatial correlations in a two-dimensional image space and design group-specific spatial kernels to produce a more interpretable regularizer. Further we propose a joint version of the SGSC model which is able to simultaneously encode a group of intrinsically related regions within a test image. An effective algorithm is developed to optimize the objective function of the Joint SGSC. The tag localization task is conducted by propagating tags from sparsely selected groups of regions to the target regions according to the reconstruction coefficients. Extensive experiments on three public image datasets illustrate that our proposed models achieve great performance improvements over the state-of-the-art method in the tag localization task.Keywords
This publication has 16 references indexed in Scilit:
- Mining multi-tag association for image taggingWorld Wide Web, 2010
- Automatic image annotation using group sparsityPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2010
- Image Clustering Using Local Discriminant Models and Global IntegrationIEEE Transactions on Image Processing, 2010
- Ranking with local regression and global alignment for cross media retrievalPublished by Association for Computing Machinery (ACM) ,2009
- Automatic video tagging using content redundancyPublished by Association for Computing Machinery (ACM) ,2009
- Annotating Images by Mining Image Search ResultsIeee Transactions On Pattern Analysis and Machine Intelligence, 2008
- TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and ContextInternational Journal of Computer Vision, 2007
- Exploiting spatial context constraints for automatic image region annotationPublished by Association for Computing Machinery (ACM) ,2007
- Model selection and estimation in regression with grouped variablesJournal of the Royal Statistical Society Series B: Statistical Methodology, 2005
- Region based image annotation through multiple-instance learningPublished by Association for Computing Machinery (ACM) ,2005