在存在异变地形的情况下,为改善地方规划而加强地方规划的深中视觉关注 (Trajectory-Constrained Deep Latent Visual Attention for Improved Local Planning in Presence of Heterogeneous Terrain)

from arxiv, Published in International Conference on Intelligent Robots and Systems (IROS) 2021 proceedings. Project website: https://sites.google.com/view/traj-constrain-visual-attn

We present a reward-predictive, model-based deep learning method featuring trajectory-constrained visual attention for local planning in visual navigation tasks. Our method learns to place visual attention at locations in latent image space which follow trajectories caused by vehicle control actions to enhance predictive accuracy during planning. The attention model is jointly optimized by the task-specific loss and an additional trajectory-constraint loss, allowing adaptability yet encouraging a regularized structure for improved generalization and reliability. Importantly, visual attention is applied in latent feature map space instead of raw image space to promote efficient planning. We validated our model in visual navigation tasks of planning low turbulence, collision-free trajectories in off-road settings and hill climbing with locking differentials in the presence of slippery terrain. Experiments involved randomized procedural generated simulation and real-world environments. We found our method improved generalization and learning efficiency when compared to no-attention and self-attention alternatives.

翻译：我们展示了一种有奖励的、基于模型的深层次学习方法,在视觉导航任务的地方规划中,以轨迹限制的视觉关注方式进行视觉关注; 我们的方法学会了将视觉关注置于潜伏的图像空间中,随着车辆控制行动所引发的轨迹提高预测准确度,从而在规划期间提高预测准确性; 关注模式因特定任务的损失和额外的轨迹限制损失而共同优化,允许适应性,但又鼓励一种正规化的结构来改进一般化和可靠性。重要的是,视觉关注应用在潜伏地图空间,而不是原始的图像空间,以促进有效的规划。我们验证了我们的视觉导航任务模式,即规划在路外环境中的低波动、无碰撞轨迹和山坡上,在滑坡的地形出现时有锁定差异。实验涉及随机化程序产生的模拟和现实世界环境。我们发现,与不注意和自我注意的替代方法相比,我们的方法提高了一般化和学习效率。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日