Empirical Comparison of Automatic Image Annotation Systems

1 November 2008

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Abstract

The performance of content-based image retrieval systems has proved to be inherently constrained by the used low-level features, and cannot give satisfactory results when the user's high level concepts cannot be expressed by low level features. In an attempt to bridge this semantic gap, recent approaches started integrating both low level-visual features and high-level textual keywords. Unfortunately, manual image annotation is a tedious process and may not be possible for large image databases. To overcome this limitation, several approaches that can annotate images in a semi-supervised or unsupervised way have emerged. In this paper, we outline and compare four different algorithms. The first one is simple and assumes that image annotation can be viewed as the task of translating from a vocabulary of fixed image regions to a vocabulary of words. The second approach uses a set of annotated images as a training set and learns the joint distribution of regions and words. The third and fourth approaches are based on segmenting the images into homogeneous regions. Both of these approaches rely on a clustering algorithm to learn the association between visual features and keywords. The clustering task is not trivial as it involves clustering a very high-dimensional and sparse feature spaces. To address this, the third approach uses semi-supervised constrained clustering while the fourth approach relies on an algorithm that performs simultaneous clustering and feature discrimination. These four algorithms were implemented and tested on a data set that includes 6000 images using four-fold cross validation.

Keywords

This publication has 21 references indexed in Scilit:

Region Based Image Annotation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2006
Unsupervised Image Segmentation and Annotation for Content-Based Image Retrieval
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2006
Multiple Bernoulli relevance models for image and video annotation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2004
Modeling annotated data
Published by Association for Computing Machinery (ACM) ,2003
Indoor-outdoor image classification
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Learning the semantics of words and pictures
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
ON IMAGE CLASSIFICATION: CITY IMAGES VS. LANDSCAPES
Pattern Recognition, 1998
Clustering by competitive agglomeration
Pattern Recognition, 1997
Searching for Dependencies in Bayesian Classifiers
Published by Springer Science and Business Media LLC ,1996
Semi-naive bayesian classifier
Lecture Notes in Computer Science, 1991

Cited by 1 article