World-wide scale geotagged image dataset for automatic image annotation and reverse geotagging

Abstract
International audienceIn this paper, a dataset of geotagged photos on a world-wide scale is presented. The dataset contains a sample of more than 14 Million geotagged photos crawled from Flickr with the corresponding metadata. To guarantee the spatial representativeness of the dataset, a crawling approach based on the small-world phenomena and the Flickr friendship's graph is applied. Furthermore, the noisiness of user-provided tags is reduced through an automatic tag cleaning approach. To enable efficient retrieval, photos in the dataset are indexed based on their location information using quad-tree data structure. The dataset can assists different applications, especially, search-based automatic image annotation and reverse geotagging

This publication has 17 references indexed in Scilit: