Harvesting Image Databases from the Web

1 January 2007

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Abstract

The objective of this work is to automatically generate a large number of images for a specified object class (for example, penguin). A multi-modal approach employing both text, meta data and visual features is used to gather many, high-quality images from the Web. Candidate images are obtained by a text based Web search querying on the object identifier (the word penguin). The Web pages and the images they contain are downloaded. The task is then to remove irrelevant images and re-rank the remainder. First, the images are re-ranked using a Bayes posterior estimator trained on the text surrounding the image and meta data features (such as the image alternative tag, image title tag, and image filename). No visual information is used at this stage. Second, the top-ranked images are used as (noisy) training data and a SVM visual classifier is learnt to improve the ranking further. The principal novelty is in combining text/meta-data and visual features in order to achieve a completely automatic ranking of the images. Examples are given for a selection of animals (e.g. camels, sharks, penguins), vehicles (cars, airplanes, bikes) and other classes (guitar, wristwatch), totalling 18 classes. The results are assessed by precision/recall curves on ground truth annotated data and by comparison to previous approaches including those of Berg et al. (on an additional six classes) and Fergus et al.

Keywords

This publication has 10 references indexed in Scilit:

OPTIMOL: automatic Online Picture collecTion via Incremental MOdel Learning
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2007
Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study
International Journal of Computer Vision, 2006
Animals on the Web
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2006
Learning object categories from Google's image search
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Names and faces in the news
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2004
Web image retrieval re-ranking with relevance model
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2004
Selection of scale-invariant parts for object class recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Contextual Priming for Object Detection
International Journal of Computer Vision, 2003
Models for metasearch
Published by Association for Computing Machinery (ACM) ,2001
Object recognition from local scale-invariant features
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1999

Cited by 126 articles