学习插入式构建的神经车辆路径规划求解器 (Learning to Insert for Constructive Neural Vehicle Routing Solver)

Neural Combinatorial Optimisation (NCO) is a promising learning-based approach for solving Vehicle Routing Problems (VRPs) without extensive manual design. While existing constructive NCO methods typically follow an appending-based paradigm that sequentially adds unvisited nodes to partial solutions, this rigid approach often leads to suboptimal results. To overcome this limitation, we explore the idea of insertion-based paradigm and propose Learning to Construct with Insertion-based Paradigm (L2C-Insert), a novel learning-based method for constructive NCO. Unlike traditional approaches, L2C-Insert builds solutions by strategically inserting unvisited nodes at any valid position in the current partial solution, which can significantly enhance the flexibility and solution quality. The proposed framework introduces three key components: a novel model architecture for precise insertion position prediction, an efficient training scheme for model optimization, and an advanced inference technique that fully exploits the insertion paradigm's flexibility. Extensive experiments on both synthetic and real-world instances of the Travelling Salesman Problem (TSP) and Capacitated Vehicle Routing Problem (CVRP) demonstrate that L2C-Insert consistently achieves superior performance across various problem sizes.

翻译：神经组合优化（NCO）是一种基于学习的、无需大量人工设计的车辆路径问题（VRP）求解方法，具有广阔前景。现有的构建式NCO方法通常遵循基于追加的范式，即依次将未访问节点添加到部分解中，但这种僵化的方法往往导致次优结果。为克服这一局限，我们探索了基于插入的范式思想，并提出了一种新颖的基于学习的构建式NCO方法——学习基于插入范式的构建（L2C-Insert）。与传统方法不同，L2C-Insert通过在当前部分解的任何有效位置策略性地插入未访问节点来构建解，这能显著提升解的灵活性和质量。该框架引入了三个关键组件：用于精确预测插入位置的新型模型架构、用于模型优化的高效训练方案，以及充分利用插入范式灵活性的高级推理技术。在旅行商问题（TSP）和容量约束车辆路径问题（CVRP）的合成及真实实例上进行的大量实验表明，L2C-Insert在不同问题规模下均能持续取得优越性能。