MixedFusion: An Efficient Multimodal Data Fusion Framework for 3-D Object Detection and Tracking

Cheng Zhang; Hai Wang; Long Chen; Yicheng Li; Yingfeng Cai

doi:10.1109/TNNLS.2023.3325527

MixedFusion: An Efficient Multimodal Data Fusion Framework for 3-D Object Detection and Tracking

IEEE Trans Neural Netw Learn Syst. 2023 Nov 1:PP. doi: 10.1109/TNNLS.2023.3325527. Online ahead of print.

Authors

Cheng Zhang, Hai Wang, Long Chen, Yicheng Li, Yingfeng Cai

PMID: 37910413
DOI: 10.1109/TNNLS.2023.3325527

Abstract

The performance of environmental perception is critical for the safe driving of intelligent connected vehicles (ICVs). Currently, the most prevalent technical solutions are based on multimodal data fusion to achieve a comprehensive perception of the surrounding environment. However, existing fusion perception methods suffer from issues such as low sensor data utilization and unreasonable fusion strategies, which severely limit their performance in adverse weather conditions. To address these issues, this article proposes a novel multimodal data fusion framework called MixedFusion. In this framework, we introduce two innovative fusion strategies for the data characteristics of each sensor: high-level semantic guidance (HLSG) and multipriority matching (MPM). It not only realizes the efficient utilization of the multimodal data but also further realizes the complementary fusion between the multimodal data. We perform extensive experiments on the nuScenes and K-radar datasets. The experimental results demonstrate that the fusion framework proposed in this article significantly improves the performance of 3-D object detection and tracking in severe weather conditions.