DOTA: A Large-Scale Dataset for Object Detection in Aerial Images
Open Access
- 1 June 2018
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 3974-3983
- https://doi.org/10.1109/cvpr.2018.00418
Abstract
Object detection is an important and challenging problem in computer vision. Although the past decade has witnessed major advances in object detection in natural scenes, such successes have been slow to aerial imagery, not only because of the huge variation in the scale, orientation and shape of the object instances on the earth's surface, but also due to the scarcity of well-annotated datasets of objects in aerial scenes. To advance object detection research in Earth Vision, also known as Earth Observation and Remote Sensing, we introduce a large-scale Dataset for Object deTection in Aerial images (DOTA). To this end, we collect 2806 aerial images from different sensors and platforms. Each image is of the size about 4000 × 4000 pixels and contains objects exhibiting a wide variety of scales, orientations, and shapes. These DOTA images are then annotated by experts in aerial image interpretation using 15 common object categories. The fully annotated DOTA images contains 188, 282 instances, each of which is labeled by an arbitrary (8 d.o.f.) quadrilateral. To build a baseline for object detection in Earth Vision, we evaluate state-of-the-art object detection algorithms on DOTA. Experiments demonstrate that DOTA well represents real Earth Vision applications and are quite challenging.Keywords
This publication has 29 references indexed in Scilit:
- Vehicle detection in aerial imagery : A small target detection benchmarkJournal of Visual Communication and Image Representation, 2016
- Orientation robust object detection in aerial images using deep convolutional neural networkPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2015
- ICDAR 2015 competition on Robust ReadingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2015
- Building detection in very high resolution multispectral data with deep learning featuresPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2015
- Fast Multiclass Vehicle Detection on Aerial ImagesIEEE Geoscience and Remote Sensing Letters, 2015
- Going deeper with convolutionsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2015
- Rotation-Invariant Object Detection in Remote Sensing Images Based on Radial-Gradient AngleIEEE Geoscience and Remote Sensing Letters, 2014
- Detecting Cars in UAV Images With a Catalog-Based ApproachIEEE Transactions on Geoscience and Remote Sensing, 2014
- A Hierarchical and Contextual Model for Aerial Image ParsingInternational Journal of Computer Vision, 2009
- The Pascal Visual Object Classes (VOC) ChallengeInternational Journal of Computer Vision, 2009