Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing

Kyoungtaek Choi; Seong Min Wi; Ho Gi Jung; Jae Kyu Suhr

doi:10.3390/s23073777

Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing

Sensors (Basel). 2023 Apr 6;23(7):3777. doi: 10.3390/s23073777.

Authors

Kyoungtaek Choi¹, Seong Min Wi², Ho Gi Jung³, Jae Kyu Suhr⁴

Affiliations

¹ Department of AI Automation Robot, Daegu Catholic University, 13-13 Hayang-ro, Hayang-eup, Gyeongsan-si 38430, Gyeongsangbuk-do, Republic of Korea.
² Driving Image Recognition Logic Cell, Hyundai Mobis, 17-2 Mabuk-ro 240beon-gil, Giheung-gu, Yongin-si 16891, Gyeonggi-do, Republic of Korea.
³ Department of Electronic Engineering, Korea National University of Transportation, 50 Daehak-ro, Chungju-si 27469, Chungbuk-do, Republic of Korea.
⁴ Department of Intelligent Mechatronics Engineering, Sejong University, 209 Neungdong-ro, Gwangjin-gu, Seoul 05006, Republic of Korea.

Abstract

This paper presents a method for simplifying and quantizing a deep neural network (DNN)-based object detector to embed it into a real-time edge device. For network simplification, this paper compares five methods for applying channel pruning to a residual block because special care must be taken regarding the number of channels when summing two feature maps. Based on the comparison in terms of detection performance, parameter number, computational complexity, and processing time, this paper discovers the most satisfying method on the edge device. For network quantization, this paper compares post-training quantization (PTQ) and quantization-aware training (QAT) using two datasets with different detection difficulties. This comparison shows that both approaches are recommended in the case of the easy-to-detect dataset, but QAT is preferable in the case of the difficult-to-detect dataset. Through experiments, this paper shows that the proposed method can effectively embed the DNN-based object detector into an edge device equipped with Qualcomm's QCS605 System-on-Chip (SoC), while achieving a real-time operation with more than 10 frames per second.

Keywords: channel pruning; edge computing; network simplification; object detector.

Abstract

Grants and funding