通过深强化学习解决接儿和分娩问题的不同关注 (Heterogeneous Attentions for Solving Pickup and Delivery Problem via Deep Reinforcement Learning)

Recently, there is an emerging trend to apply deep reinforcement learning to solve the vehicle routing problem (VRP), where a learnt policy governs the selection of next node for visiting. However, existing methods could not handle well the pairing and precedence relationships in the pickup and delivery problem (PDP), which is a representative variant of VRP. To address this challenging issue, we leverage a novel neural network integrated with a heterogeneous attention mechanism to empower the policy in deep reinforcement learning to automatically select the nodes. In particular, the heterogeneous attention mechanism specifically prescribes attentions for each role of the nodes while taking into account the precedence constraint, i.e., the pickup node must precede the pairing delivery node. Further integrated with a masking scheme, the learnt policy is expected to find higher-quality solutions for solving PDP. Extensive experimental results show that our method outperforms the state-of-the-art heuristic and deep learning model, respectively, and generalizes well to different distributions and problem sizes.

翻译：最近出现了一种新趋势,即运用深度强化学习来解决车辆路由问题(VRP),在选择访问的下一个节点时要遵循一项已学习的政策;然而,现有方法无法很好地处理接送问题(PDP)中的配对和优先关系(PDP),这是VRP的一个有代表性的变体。为了解决这个具有挑战性的问题,我们利用一个与不同关注机制相结合的新神经网络,使政策在深度强化学习中能够自动选择节点。特别是,混合关注机制具体规定了节点的每个作用的注意,同时考虑到优先限制,即配送节点之前必须先有接合节点。进一步与遮罩计划相结合,预期所学的政策将找到更高质量的解决方案来解决PDP。广泛的实验结果显示,我们的方法分别超越了最先进的超常和深层学习模式,并概括了不同的分布和问题大小。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

ICML 2021论文收录

专知会员服务

123+阅读 · 2021年5月8日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日