3D Vehicle Detection and Segmentation Based on EfficientNetB3 and CenterNet Residual Blocks

Sensors (Basel). 2022 Oct 20;22(20):7990. doi: 10.3390/s22207990.

Abstract

In this paper, we present a two stages solution to 3D vehicle detection and segmentation. The first stage depends on the combination of EfficientNetB3 architecture with multiparallel residual blocks (inspired by CenterNet architecture) for 3D localization and poses estimation for vehicles on the scene. The second stage takes the output of the first stage as input (cropped car images) to train EfficientNet B3 for the image recognition task. Using predefined 3D Models, we substitute each vehicle on the scene with its match using the rotation matrix and translation vector from the first stage to get the 3D detection bounding boxes and segmentation masks. We trained our models on an open-source dataset (ApolloCar3D). Our method outperforms all published solutions in terms of 6 degrees of freedom error (6 DoF err).

Keywords: 3D object detection; 3D segmentation; autonomous driving; image processing; localization; machine learning; vehicle classification.

MeSH terms

  • Delayed Emergence from Anesthesia*
  • Humans
  • Imaging, Three-Dimensional* / methods
  • Tomography, X-Ray Computed / methods