Vehicle routing problem (VRP) is a typical discrete combinatorial optimization problem, and many models and algorithms have been proposed to solve VRP and variants. Although existing approaches has contributed a lot to the development of this field, these approaches either are limited in problem size or need manual intervening in choosing parameters. To tackle these difficulties, many studies consider learning-based optimization algorithms to solve VRP. This paper reviews recent advances in this field and divides relevant approaches into end-to-end approaches and step-by-step approaches. We design three part experiments to justly evaluate performance of four representative learning-based optimization algorithms and conclude that combining heuristic search can effectively improve learning ability and sampled efficiency of LBO models. Finally we point out that research trend of LBO algorithms is to solve large-scale and multiple constraints problems from real world.
翻译:车辆路由问题(VRP)是一个典型的离散组合优化问题,许多模型和算法都提出了解决VRP和变式的方法。虽然现有方法对这一领域的发展作出了很大贡献,但这些方法在问题规模上是有限的,或者需要人工干预来选择参数。为了解决这些困难,许多研究考虑以学习为基础的优化算法来解决VRP。本文回顾了该领域的最新进展,将相关方法分为端到端的方法和逐步的方法。我们设计了三个部分的实验,以公正评估四个有代表性的基于学习的优化算法的绩效,并得出结论,将超自然研究结合起来可以有效地提高学习能力和LBO模型的抽样效率。最后,我们指出,LBO算法的研究趋势是解决现实世界的大规模和多重制约问题。