YOLOv5 with ConvMixer Prediction Heads for Precise Object Detection in Drone Imagery

Sensors (Basel). 2022 Nov 2;22(21):8424. doi: 10.3390/s22218424.

Abstract

The potency of object detection techniques using Unmanned Aerial Vehicles (UAVs) is unprecedented due to their mobility. This potency has stimulated the use of UAVs with object detection functionality in numerous crucial real-life applications. Additionally, more efficient and accurate object detection techniques are being researched and developed for usage in UAV applications. However, object detection in UAVs presents challenges that are not common to general object detection. First, as UAVs fly at varying altitudes, the objects imaged via UAVs vary vastly in size, making the task at hand more challenging. Second due to the motion of the UAVs, there could be a presence of blur in the captured images. To deal with these challenges, we present a You Only Look Once v5 (YOLOv5)-like architecture with ConvMixers in its prediction heads and an additional prediction head to deal with minutely-small objects. The proposed architecture has been trained and tested on the VisDrone 2021 dataset, and the acquired results are comparable with the existing state-of-the-art methods.

Keywords: ConvMixer; UAV imagery; YOLOv5; object detection.

MeSH terms

  • Altitude
  • Data Collection
  • Remote Sensing Technology* / methods
  • Unmanned Aerial Devices*