Traditional solvers for tackling combinatorial optimization (CO) problems are usually designed by human experts. Recently, there has been a surge of interest in utilizing deep learning, especially deep reinforcement learning, to automatically learn effective solvers for CO. The resultant new paradigm is termed neural combinatorial optimization (NCO). However, the advantages and disadvantages of NCO relative to other approaches have not been empirically or theoretically well studied. This work presents a comprehensive comparative study of NCO solvers and alternative solvers. Specifically, taking the traveling salesman problem as the testbed problem, the performance of the solvers is assessed in five aspects, i.e., effectiveness, efficiency, stability, scalability, and generalization ability. Our results show that the solvers learned by NCO approaches, in general, still fall short of traditional solvers in nearly all these aspects. A potential benefit of NCO solvers would be their superior time and energy efficiency for small-size problem instances when sufficient training instances are available. Hopefully, this work would help with a better understanding of the strengths and weaknesses of NCO and provide a comprehensive evaluation protocol for further benchmarking NCO approaches in comparison to other approaches.
翻译:传统求解组合优化问题的方法通常是由人类专家设计的。最近,利用深度学习,特别是深度强化学习,自动学习有效的组合优化求解器的兴趣正在增加。这种新范式被称为神经组合优化(NCO)。然而,相对于其他方法,NCO的优点和缺点尚未得到经验或理论上的充分研究。本文提出了一项全面的NCO求解器和替代求解器的比较研究。具体地,以旅行商问题为测试问题,在五个方面评估求解器的性能,即有效性、效率、稳定性、可扩展性和泛化能力。结果表明,总体而言,NCO方法学习到的求解器在这些方面仍然不如传统求解器。NCO求解器的一个潜在优点是,当有足够的训练实例可用时,它们在小规模问题实例的时间和能源效率方面可能更优。希望这项工作能帮助更好地理解NCO的优劣势,并提供进一步评估NCO方法相对于其他方法的完整评估协议。