Efficient Object Detection Based on Masking Semantic Segmentation Region for Lightweight Embedded Processors

Heuijee Yun; Daejin Park

doi:10.3390/s22228890

Efficient Object Detection Based on Masking Semantic Segmentation Region for Lightweight Embedded Processors

Sensors (Basel). 2022 Nov 17;22(22):8890. doi: 10.3390/s22228890.

Authors

Heuijee Yun¹, Daejin Park¹

Affiliation

¹ School of Electronic and Electrical Engineering, Kyungpook National University, Daegu 41566, Republic of Korea.

Abstract

Because of the development of image processing using cameras and the subsequent development of artificial intelligence technology, various fields have begun to develop. However, it is difficult to implement an image processing algorithm that requires a lot of calculations on a light board. This paper proposes a method using real-time deep learning object recognition algorithms in lightweight embedded boards. We have developed an algorithm suitable for lightweight embedded boards by appropriately using two deep neural network architectures. The first architecture requires small computational volumes, although it provides low accuracy. The second architecture uses large computational volumes and provides high accuracy. The area is determined using the first architecture, which processes semantic segmentation with relatively little computation. After masking the area using the more accurate deep learning architecture, object detection is implemented with improved accuracy, as the image is filtered by segmentation and the cases that have not been recognized by various variables, such as differentiation from the background, are excluded. OpenCV (Open source Computer Vision) is used to process input images in Python, and images are processed using an efficient neural network (ENet) and You Only Look Once (YOLO). By running this algorithm, the average error can be reduced by approximately 2.4 times, allowing for more accurate object detection. In addition, object recognition can be performed in real time for lightweight embedded boards, as a rate of about 4 FPS (frames per second) is achieved.

Keywords: ENet; OpenCV; YOLO; autonomous driving; deep learning; object detection.

MeSH terms

Algorithms
Artificial Intelligence*
Image Processing, Computer-Assisted / methods
Neural Networks, Computer
Semantics*

Abstract

MeSH terms

Grants and funding