Files
Abstract
Object detection in high-resolution aerial images is a challenging task because of1) the large variation in object size, and 2) non-uniform distribution of objects. A
common solution is to divide the large aerial image into small (uniform) crops and
then apply object detection on each small crop. In this paper, we investigate the image
cropping strategy to address these challenges. Specifically, we propose a Density-Map
guided object detection Network (DMNet), which is inspired from the observation
that the object density map of an image presents how objects distribute in terms of
the pixel intensity of the map. As pixel intensity varies, it is able to tell whether
a region has objects or not, which in turn provides guidance for cropping images
statistically. DMNet has three key components: a density map generation module,
an image cropping module and an object detector. DMNet generates a density map
and learns scale information based on density intensities to form cropping regions.
Extensive experiments show that DMNet achieves state-of-the-art performance on
two popular aerial image datasets, i.e VisionDrone and UAVDT.