In this paper, we propose a new method called the Reinforced Hybrid Genetic Algorithm (RHGA) for solving the famous NP-hard Traveling Salesman Problem (TSP). Specifically, we combine reinforcement learning with the well-known Edge Assembly Crossover genetic algorithm (EAX-GA) and the Lin-Kernighan-Helsgaun (LKH) local search heuristic. In the hybrid algorithm, LKH can help EAX-GA improve the population by its effective local search, and EAX-GA can help LKH escape from local optima by providing high-quality and diverse initial solutions. We restrict that there is only one special individual among the population in EAX-GA that can be improved by LKH. Such a mechanism can prevent the population diversity, efficiency, and algorithm performance from declining due to the redundant calling of LKH upon the population. As a result, our proposed hybrid mechanism can help EAX-GA and LKH boost each other's performance without reducing the convergence rate of the population. The reinforcement learning technique based on Q-learning further promotes the hybrid genetic algorithm. Experimental results on 138 well-known and widely used TSP benchmarks with the number of cities ranging from 1,000 to 85,900 demonstrate the excellent performance of RHGA.
翻译:在本文中,我们提出了一个名为“强化混合遗传算法(RHGA)”的新方法,用于解决著名的NP-hard旅行推销员问题(TSP ) 。 具体地说,我们把强化学习与著名的Edge大会交叉基因算法(EAX-GA)和Lin-Kernigaun(LKH)本地搜索超常基因算法(LKH)相结合。在混合算法中,LKH可以帮助EAX-GA通过有效的当地搜索来改善人口状况,EAX-GA可以帮助LKH摆脱当地奥地马,提供高质量和多样化的初步解决方案。我们限制在EAX-GA中只有一人能够由LKH改进。这样的机制可以防止人口多样性、效率和算法绩效因对人口的需求过剩而下降。因此,我们提议的混合算法机制可以帮助EAX-GA和LKH提高他人的性能,而不会降低人口的趋同率。基于Q学习的强化学习技术,可以进一步促进EAX-GA8-138号的混合遗传算法实验性标准。