A 3D U-Net Based on a Vision Transformer for Radar Semantic Segmentation

Tongrui Zhang; Yunsheng Fan

doi:10.3390/s23249630

A 3D U-Net Based on a Vision Transformer for Radar Semantic Segmentation

Sensors (Basel). 2023 Dec 5;23(24):9630. doi: 10.3390/s23249630.

Authors

Tongrui Zhang¹, Yunsheng Fan¹

Affiliation

¹ College of Marine Electrical Engineering, Dalian Maritime University, Dalian 116026, China.

Abstract

Radar data can be presented in various forms, unlike visible data. In the field of radar target recognition, most current work involves point cloud data due to computing limitations, but this form of data lacks useful information. This paper proposes a semantic segmentation network to process high-dimensional data and enable automatic radar target recognition. Rather than relying on point cloud data, which is common in current radar automatic target recognition algorithms, the paper suggests using a radar heat map of high-dimensional data to increase the efficiency of radar data use. The radar heat map provides more complete information than point cloud data, leading to more accurate classification results. Additionally, this paper proposes a dimension collapse module based on a vision transformer for feature extraction between two modules with dimension differences during dimension changes in high-dimensional data. This module is easily extendable to other networks with high-dimensional data collapse requirements. The network's performance is verified using a real radar dataset, showing that the radar semantic segmentation network based on a vision transformer has better performance and fewer parameters compared to segmentation networks that use other dimensional collapse methods.

Keywords: 3D U-Net; data cube; radar semantic segmentation; transformer.

Abstract

Grants and funding