Harvesting Image Databases from the Web
- 1 January 2007
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
The objective of this work is to automatically generate a large number of images for a specified object class (for example, penguin). A multi-modal approach employing both text, meta data and visual features is used to gather many, high-quality images from the Web. Candidate images are obtained by a text based Web search querying on the object identifier (the word penguin). The Web pages and the images they contain are downloaded. The task is then to remove irrelevant images and re-rank the remainder. First, the images are re-ranked using a Bayes posterior estimator trained on the text surrounding the image and meta data features (such as the image alternative tag, image title tag, and image filename). No visual information is used at this stage. Second, the top-ranked images are used as (noisy) training data and a SVM visual classifier is learnt to improve the ranking further. The principal novelty is in combining text/meta-data and visual features in order to achieve a completely automatic ranking of the images. Examples are given for a selection of animals (e.g. camels, sharks, penguins), vehicles (cars, airplanes, bikes) and other classes (guitar, wristwatch), totalling 18 classes. The results are assessed by precision/recall curves on ground truth annotated data and by comparison to previous approaches including those of Berg et al. (on an additional six classes) and Fergus et al.Keywords
This publication has 10 references indexed in Scilit:
- OPTIMOL: automatic Online Picture collecTion via Incremental MOdel LearningPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive StudyInternational Journal of Computer Vision, 2006
- Animals on the WebPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- Learning object categories from Google's image searchPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Names and faces in the newsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Web image retrieval re-ranking with relevance modelPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Selection of scale-invariant parts for object class recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Contextual Priming for Object DetectionInternational Journal of Computer Vision, 2003
- Models for metasearchPublished by Association for Computing Machinery (ACM) ,2001
- Object recognition from local scale-invariant featuresPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1999