地点和内容:驾驶员关注物体探测 (Where and What: Driver Attention-based Object Detection)

Human drivers use their attentional mechanisms to focus on critical objects and make decisions while driving. As human attention can be revealed from gaze data, capturing and analyzing gaze information has emerged in recent years to benefit autonomous driving technology. Previous works in this context have primarily aimed at predicting "where" human drivers look at and lack knowledge of "what" objects drivers focus on. Our work bridges the gap between pixel-level and object-level attention prediction. Specifically, we propose to integrate an attention prediction module into a pretrained object detection framework and predict the attention in a grid-based style. Furthermore, critical objects are recognized based on predicted attended-to areas. We evaluate our proposed method on two driver attention datasets, BDD-A and DR(eye)VE. Our framework achieves competitive state-of-the-art performance in the attention prediction on both pixel-level and object-level but is far more efficient (75.3 GFLOPs less) in computation.

翻译：人类驱动器使用其关注机制来关注关键物体,在驾驶时作出决定。由于视觉数据可以显示人类的注意力,近年来,为了让自主驱动技术受益,获取和分析视觉信息已经出现。这方面的以往工作主要旨在预测“哪里”人类驱动器对“什么”物体驱动器的观察,并缺乏对“什么”物体驱动器的了解。我们的工作弥合了像素水平与目标水平关注预测之间的差距。具体地说,我们提议将关注预测模块纳入预先训练的物体探测框架,并预测基于网格的注意方式。此外,关键物体是根据预测的从旁观到外观领域得到承认的。我们用两个驱动器关注数据集,即BDD-A和DR(眼)VE评估了我们拟议的方法。我们的框架在像素水平和对象水平的注意预测中取得了有竞争力的、最先进的业绩,但在计算中效率要高得多(75.3GFLOPs 更少 )。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日