Deep Learning-Based Target Tracking and Classification for Low Quality Videos Using Coded Aperture Cameras

Sensors (Basel). 2019 Aug 26;19(17):3702. doi: 10.3390/s19173702.

Abstract

Compressive sensing has seen many applications in recent years. One type of compressive sensing device is the Pixel-wise Code Exposure (PCE) camera, which has low power consumption and individual control of pixel exposure time. In order to use PCE cameras for practical applications, a time consuming and lossy process is needed to reconstruct the original frames. In this paper, we present a deep learning approach that directly performs target tracking and classification in the compressive measurement domain without any frame reconstruction. In particular, we propose to apply You Only Look Once (YOLO) to detect and track targets in the frames and we propose to apply Residual Network (ResNet) for classification. Extensive simulations using low quality optical and mid-wave infrared (MWIR) videos in the SENSIAC database demonstrated the efficacy of our proposed approach.

Keywords: MWIR; ResNet; YOLO; compressive sensing; optical; pixel-wise code exposure camera; target classification; target tracking.