Conditional Neural Heuristic for Multiobjective Vehicle Routing Problems

Mingfeng Fan; Yaoxin Wu; Zhiguang Cao; Wen Song; Guillaume Sartoretti; Huan Liu; Guohua Wu

doi:10.1109/TNNLS.2024.3371706

Conditional Neural Heuristic for Multiobjective Vehicle Routing Problems

IEEE Trans Neural Netw Learn Syst. 2024 Mar 22:PP. doi: 10.1109/TNNLS.2024.3371706. Online ahead of print.

Authors

Mingfeng Fan, Yaoxin Wu, Zhiguang Cao, Wen Song, Guillaume Sartoretti, Huan Liu, Guohua Wu

PMID: 38517723
DOI: 10.1109/TNNLS.2024.3371706

Abstract

Existing neural heuristics for multiobjective vehicle routing problems (MOVRPs) are primarily conditioned on instance context, which failed to appropriately exploit preference and problem size, thus holding back the performance. To thoroughly unleash the potential, we propose a novel conditional neural heuristic (CNH) that fully leverages the instance context, preference, and size with an encoder-decoder structured policy network. Particularly, in our CNH, we design a dual-attention-based encoder to relate preferences and instance contexts, so as to better capture their joint effect on approximating the exact Pareto front (PF). We also design a size-aware decoder based on the sinusoidal encoding to explicitly incorporate the problem size into the embedding, so that a single trained model could better solve instances of various scales. Besides, we customize the REINFORCE algorithm to train the neural heuristic by leveraging stochastic preferences (SPs), which further enhances the training performance. Extensive experimental results on random and benchmark instances reveal that our CNH could achieve favorable approximation to the whole PF with higher hypervolume (HV) and lower optimality gap (Gap) than those of the existing neural and conventional heuristics. More importantly, a single trained model of our CNH can outperform other neural heuristics that are exclusively trained on each size. In addition, the effectiveness of the key designs is also verified through ablation studies.