解决无人驾驶旅行推销员问题深度强化学习方法 (A Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drone)

Reinforcement learning has recently shown promise in learning quality solutions in many combinatorial optimization problems. In particular, the attention-based encoder-decoder models show high effectiveness on various routing problems, including the Traveling Salesman Problem (TSP). Unfortunately, they perform poorly for the TSP with Drone (TSP-D), requiring routing a heterogeneous fleet of vehicles in coordination -- a truck and a drone. In TSP-D, the two vehicles are moving in tandem and may need to wait at a node for the other vehicle to join. State-less attention-based decoder fails to make such coordination between vehicles. We propose a hybrid model that uses an attention encoder and a Long Short-Term Memory (LSTM) network decoder, in which the decoder's hidden state can represent the sequence of actions made. We empirically demonstrate that such a hybrid model improves upon a purely attention-based model for both solution quality and computational efficiency. Our experiments on the min-max Capacitated Vehicle Routing Problem (mmCVRP) also confirm that the hybrid model is more suitable for the coordinated routing of multiple vehicles than the attention-based model. The proposed model demonstrates comparable results as the operations research baseline methods.

翻译：强化学习最近在许多组合优化问题中显示出学习质量解决方案的希望,特别是,基于关注的编码器-编码器模型显示,在包括旅行销售员问题(TSP)在内的各种路由问题上,在包括旅行销售员问题(TSP)在内的各种路由问题上,它们表现不佳。不幸的是,对于TSP来说,它们与无人机(TSP-D)相比表现不佳,这需要协调不同的车队 -- -- 一辆卡车和无人驾驶飞机。在TSP-D中,两部车辆正在同步移动,可能需要等待另一部车辆加入的节点。基于关注的编码器解码器无法在车辆之间进行协调。我们提出的混合模型使用注意编码器和长期短期内存(LSTM)网络解码器,其中解码器的隐藏状态可以代表所采取行动的顺序。我们的经验证明,这种混合模型在纯粹基于关注的解决方案质量和计算效率模式的基础上改进了这种混合模式。我们在微积分立式车辆问题(mmCVRP)上进行的实验还证实,混合模型比拟的模型更适合协调的车辆基线研究方法。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日