3D Vehicle Detection and Segmentation Based on EfficientNetB3 and CenterNet Residual Blocks

Alexey Kashevnik; Ammar Ali

doi:10.3390/s22207990

3D Vehicle Detection and Segmentation Based on EfficientNetB3 and CenterNet Residual Blocks

Sensors (Basel). 2022 Oct 20;22(20):7990. doi: 10.3390/s22207990.

Authors

Alexey Kashevnik¹, Ammar Ali²

Affiliations

¹ St. Petersburg Federal Research Center of the Russian Academy of Sciences, SPC RAS, 199178 St. Petersburg, Russia.
² Information Technology and Programming Faculty, ITMO University, 197101 St. Petersburg, Russia.

Abstract

In this paper, we present a two stages solution to 3D vehicle detection and segmentation. The first stage depends on the combination of EfficientNetB3 architecture with multiparallel residual blocks (inspired by CenterNet architecture) for 3D localization and poses estimation for vehicles on the scene. The second stage takes the output of the first stage as input (cropped car images) to train EfficientNet B3 for the image recognition task. Using predefined 3D Models, we substitute each vehicle on the scene with its match using the rotation matrix and translation vector from the first stage to get the 3D detection bounding boxes and segmentation masks. We trained our models on an open-source dataset (ApolloCar3D). Our method outperforms all published solutions in terms of 6 degrees of freedom error (6 DoF err).

Keywords: 3D object detection; 3D segmentation; autonomous driving; image processing; localization; machine learning; vehicle classification.

MeSH terms

Delayed Emergence from Anesthesia*
Humans
Imaging, Three-Dimensional* / methods
Tomography, X-Ray Computed / methods

Abstract

MeSH terms

Grants and funding