Efficient Object Detection Based on Masking Semantic Segmentation Region for Lightweight Embedded Processors

Sensors (Basel). 2022 Nov 17;22(22):8890. doi: 10.3390/s22228890.

Abstract

Because of the development of image processing using cameras and the subsequent development of artificial intelligence technology, various fields have begun to develop. However, it is difficult to implement an image processing algorithm that requires a lot of calculations on a light board. This paper proposes a method using real-time deep learning object recognition algorithms in lightweight embedded boards. We have developed an algorithm suitable for lightweight embedded boards by appropriately using two deep neural network architectures. The first architecture requires small computational volumes, although it provides low accuracy. The second architecture uses large computational volumes and provides high accuracy. The area is determined using the first architecture, which processes semantic segmentation with relatively little computation. After masking the area using the more accurate deep learning architecture, object detection is implemented with improved accuracy, as the image is filtered by segmentation and the cases that have not been recognized by various variables, such as differentiation from the background, are excluded. OpenCV (Open source Computer Vision) is used to process input images in Python, and images are processed using an efficient neural network (ENet) and You Only Look Once (YOLO). By running this algorithm, the average error can be reduced by approximately 2.4 times, allowing for more accurate object detection. In addition, object recognition can be performed in real time for lightweight embedded boards, as a rate of about 4 FPS (frames per second) is achieved.

Keywords: ENet; OpenCV; YOLO; autonomous driving; deep learning; object detection.

MeSH terms

  • Algorithms
  • Artificial Intelligence*
  • Image Processing, Computer-Assisted / methods
  • Neural Networks, Computer
  • Semantics*