WCNN3D: Wavelet Convolutional Neural Network-Based 3D Object Detection for Autonomous Driving

Sensors (Basel). 2022 Sep 16;22(18):7010. doi: 10.3390/s22187010.

Abstract

Three-dimensional object detection is crucial for autonomous driving to understand the driving environment. Since the pooling operation causes information loss in the standard CNN, we designed a wavelet-multiresolution-analysis-based 3D object detection network without a pooling operation. Additionally, instead of using a single filter like the standard convolution, we used the lower-frequency and higher-frequency coefficients as a filter. These filters capture more relevant parts than a single filter, enlarging the receptive field. The model comprises a discrete wavelet transform (DWT) and an inverse wavelet transform (IWT) with skip connections to encourage feature reuse for contrasting and expanding layers. The IWT enriches the feature representation by fully recovering the lost details during the downsampling operation. Element-wise summation was used for the skip connections to decrease the computational burden. We trained the model for the Haar and Daubechies (Db4) wavelets. The two-level wavelet decomposition result shows that we can build a lightweight model without losing significant performance. The experimental results on KITTI's BEV and 3D evaluation benchmark show that our model outperforms the PointPillars-based model by up to 14% while reducing the number of trainable parameters.

Keywords: 3D object detection; LIDAR data; autonomous driving; deep learning; wavelets.

MeSH terms

  • Algorithms*
  • Neural Networks, Computer*
  • Wavelet Analysis

Grants and funding

This research received no external funding.