Places: A 10 Million Image Database for Scene Recognition

Top Cited Papers
Open Access
Abstract
The rise of multi-million-item dataset initiatives has enabled data-hungry machine learning algorithms to reach near-human semantic classification performance at tasks such as visual object and scene recognition. Here we describe the Places Database, a repository of 10 million scene photographs, labeled with scene semantic categories, comprising a large and diverse list of the types of environments encountered in the world. Using the state-of-the-art Convolutional Neural Networks (CNNs), we provide scene classification CNNs (Places-CNNs) as baselines, that significantly outperform the previous approaches. Visualization of the CNNs trained on Places shows that object detectors emerge as an intermediate representation of scene classification. With its high-coverage and high-diversity of exemplars, the Places Database along with the Places-CNNs offer a novel resource to guide future progress on scene recognition problems.
Funding Information
  • US National Science Foundation (1016862 to A.O., 1524817 to A.T.)
  • Basic Research Office of the Assistant Secretary of Defense for Research and Engineering
  • Office of Naval Research (N00014-16-1-3116 to A.O.)
  • MIT Big Data Initiative at CSAIL
  • Toyota Research Institute / MIT CSAIL Joint Research Center, Google, Xerox and Amazon Awards
  • NVIDIA Corporation
  • Facebook Fellowship

This publication has 30 references indexed in Scilit: