Recently, several studies have explored the use of neural network to solve different routing problems, which is an auspicious direction. These studies usually design an encoder-decoder based framework that uses encoder embeddings of nodes and the problem-specific context to produce node sequence(path), and further optimize the produced result on top by beam search. However, existing models can only support node coordinates as input, ignore the self-referential property of the studied routing problems, and lack the consideration about the low reliability in the initial stage of node selection, thus are hard to be applied in real-world. In this paper, we take the orienteering problem as an example to tackle these limitations. We propose a novel combination of a variant beam search algorithm and a learned heuristic for solving the general orienteering problem. We acquire the heuristic with an attention network that takes the distances among nodes as input, and learn it via a reinforcement learning framework. The empirical studies show that our method can surpass a wide range of baselines and achieve results close to the optimal or highly specialized approach. Also, our proposed framework can be easily applied to other routing problems. Our code is publicly available.
翻译:最近,一些研究探索了使用神经网络解决不同路由问题,这是一个很好的方向。这些研究通常设计一个基于编码器解码器的框架,使用节点和特定问题背景的编码器嵌入编码器来生成节点序列(路径),并通过光束搜索进一步优化顶部生成的结果。然而,现有的模型只能支持节点坐标作为输入,忽视所研究路由问题的自偏向属性,缺乏对节点选择初始阶段的可靠性的考虑,因此难以在现实世界中应用。在本文中,我们以导向问题为例,解决这些限制。我们提议将变式波束搜索算法和解决一般或定向问题学的超自然理论进行新组合。我们从关注网络中获取将节点之间的距离作为输入,并通过强化学习框架来学习。实验研究表明,我们的方法可以超越广泛的基线,并取得接近最佳或高度专业化方法的成果。此外,我们提议的框架可以很容易被公开应用到其他规则。