动态标题: 一致物体探测头 (Dynamic Head: Unifying Object Detection Heads with Attentions) - 专知论文

会员服务 ·

0

目标检测 · Performer · Backbone · 注意力机制 · state-of-the-art ·

2021 年 6 月 15 日

Dynamic Head: Unifying Object Detection Heads with Attentions

翻译：动态标题: 一致物体探测头

Xiyang Dai,Yinpeng Chen,Bin Xiao,Dongdong Chen,Mengchen Liu,Lu Yuan,Lei Zhang

from arxiv, CVPR 2021 camera ready with extensions

The complex nature of combining localization and classification in object detection has resulted in the flourished development of methods. Previous works tried to improve the performance in various object detection heads but failed to present a unified view. In this paper, we present a novel dynamic head framework to unify object detection heads with attentions. By coherently combining multiple self-attention mechanisms between feature levels for scale-awareness, among spatial locations for spatial-awareness, and within output channels for task-awareness, the proposed approach significantly improves the representation ability of object detection heads without any computational overhead. Further experiments demonstrate that the effectiveness and efficiency of the proposed dynamic head on the COCO benchmark. With a standard ResNeXt-101-DCN backbone, we largely improve the performance over popular object detectors and achieve a new state-of-the-art at 54.0 AP. Furthermore, with latest transformer backbone and extra data, we can push current best COCO result to a new record at 60.6 AP. The code will be released at https://github.com/microsoft/DynamicHead.

翻译：将物体探测的本地化和分类结合起来的复杂性质已导致方法的蓬勃发展。以前的工作曾试图改进各种物体探测头的性能,但未能提出统一的观点。在本文件中,我们提出了一个新的动态头框架,将物体探测头与注意力统一起来。我们通过一致地将规模认知特征、空间认识空间位置和任务认知输出渠道之间的多重自留机制结合起来,拟议的方法大大提高了物体探测头在没有任何计算间接费用的情况下的代表性能力。进一步的实验表明,拟议的COCOCO基准动态头的效能和效率。用标准的ResNeXt-101-DCN骨干,我们大大改进了流行物体探测器的性能,在540 AP上实现了新的状态。此外,利用最新的变压器骨干和额外数据,我们可以将目前最佳COCO结果推向60.6 AP的新记录。该代码将在https://github.com/microcolft/Dynmicheadhead发布。

0

相关内容

目标检测

目标检测，也叫目标提取，是一种与计算机视觉和图像处理有关的计算机技术，用于检测数字图像和视频中特定类别的语义对象（例如人，建筑物或汽车）的实例。深入研究的对象检测领域包括面部检测和行人检测。对象检测在计算机视觉的许多领域都有应用，包括图像检索和视频监视。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

【ICCV2021】用于目标检测和实例分割的新损失函数

专知会员服务

22+阅读 · 2021年7月28日

【CVPR2021】无遗忘效应的小类样本目标检测器

专知会员服务

25+阅读 · 2021年5月23日

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

专知会员服务

51+阅读 · 2020年3月17日

【厦门大学-CVPR2020】协调可迁移性与可判别性的自适应目标检测器，Adapting Object Detectors

【厦门大学-CVPR2020】协调可迁移性与可判别性的自适应目标检测器，Adapting Object Detectors

专知会员服务

26+阅读 · 2020年3月16日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

【显著性目标检测| 2019最新综述】深度学习时代的显著目标检测（Salient Object Detection in the Deep Learning Era: An In-Depth Survey），附PDF

【显著性目标检测| 2019最新综述】深度学习时代的显著目标检测（Salient Object Detection in the Deep Learning Era: An In-Depth Survey），附PDF

专知会员服务

42+阅读 · 2019年11月23日

【目标检测 | 2019最新综述】目标检测的最新进展，附40页PDF，Recent Advances in Deep Learning for Object Detection

【目标检测 | 2019最新综述】目标检测的最新进展，附40页PDF，Recent Advances in Deep Learning for Object Detection

专知会员服务

85+阅读 · 2019年11月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

大盘点 | 性能最强的目标检测算法

大盘点 | 性能最强的目标检测算法

新智元

13+阅读 · 2019年7月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Soft-NMS – Improving Object Detection With One Line of Code

Soft-NMS – Improving Object Detection With One Line of Code

统计学习与视觉计算组

6+阅读 · 2018年3月30日

(TensorFlow)实时语义分割比较研究

(TensorFlow)实时语义分割比较研究

机器学习研究会

9+阅读 · 2018年3月12日

语义分割+视频分割开源代码集合

语义分割+视频分割开源代码集合

极市平台

35+阅读 · 2018年3月5日

整合全部顶尖目标检测算法：FAIR开源Detectron

整合全部顶尖目标检测算法：FAIR开源Detectron

炼数成金订阅号

6+阅读 · 2018年1月25日

【资源】整合全部顶尖目标检测算法：FAIR开源Detectron

【资源】整合全部顶尖目标检测算法：FAIR开源Detectron

GAN生成式对抗网络

4+阅读 · 2018年1月24日

Mask R-CNN 源代码终上线，Facebook 开源目标检测平台—Detectron

Mask R-CNN 源代码终上线，Facebook 开源目标检测平台—Detectron

AI100

7+阅读 · 2018年1月24日

资源 | 整合全部顶尖目标检测算法：FAIR开源Detectron

资源 | 整合全部顶尖目标检测算法：FAIR开源Detectron

机器之心

3+阅读 · 2018年1月23日

End-to-End Object Detection with Fully Convolutional Network

Arxiv

9+阅读 · 2021年3月26日

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Arxiv

17+阅读 · 2020年3月31日

NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection

Arxiv

7+阅读 · 2019年4月16日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

Progressive Sparse Local Attention for Video object detection

Arxiv

4+阅读 · 2019年3月21日

3D Backbone Network for 3D Object Detection

Arxiv

12+阅读 · 2019年1月24日

Receptive Field Block Net for Accurate and Fast Object Detection

Receptive Field Block Net for Accurate and Fast Object Detection

Arxiv

3+阅读 · 2018年7月26日

Localization Recall Precision (LRP): A New Performance Metric for Object Detection

Localization Recall Precision (LRP): A New Performance Metric for Object Detection

Arxiv

4+阅读 · 2018年7月4日

DetNet: A Backbone network for Object Detection

Arxiv

5+阅读 · 2018年4月17日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

VIP会员

文章信息

相关主题

注意力机制

state-of-the-art

相关VIP内容

【ICCV2021】用于目标检测和实例分割的新损失函数

专知会员服务

22+阅读 · 2021年7月28日

【CVPR2021】无遗忘效应的小类样本目标检测器

专知会员服务

25+阅读 · 2021年5月23日

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

专知会员服务

51+阅读 · 2020年3月17日

【厦门大学-CVPR2020】协调可迁移性与可判别性的自适应目标检测器，Adapting Object Detectors

【厦门大学-CVPR2020】协调可迁移性与可判别性的自适应目标检测器，Adapting Object Detectors

专知会员服务

26+阅读 · 2020年3月16日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

【显著性目标检测| 2019最新综述】深度学习时代的显著目标检测（Salient Object Detection in the Deep Learning Era: An In-Depth Survey），附PDF

【显著性目标检测| 2019最新综述】深度学习时代的显著目标检测（Salient Object Detection in the Deep Learning Era: An In-Depth Survey），附PDF

专知会员服务

42+阅读 · 2019年11月23日

【目标检测 | 2019最新综述】目标检测的最新进展，附40页PDF，Recent Advances in Deep Learning for Object Detection

【目标检测 | 2019最新综述】目标检测的最新进展，附40页PDF，Recent Advances in Deep Learning for Object Detection

专知会员服务

85+阅读 · 2019年11月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《美空军条令出版物：战略打击》最新条令

《高能激光武器》22页slides

军事前沿模型

《面向小型无人机或无人飞行器的创新雷达探测与人工智能分类技术》263页

相关资讯

大盘点 | 性能最强的目标检测算法

大盘点 | 性能最强的目标检测算法

新智元

13+阅读 · 2019年7月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Soft-NMS – Improving Object Detection With One Line of Code

Soft-NMS – Improving Object Detection With One Line of Code

统计学习与视觉计算组

6+阅读 · 2018年3月30日

(TensorFlow)实时语义分割比较研究

(TensorFlow)实时语义分割比较研究

机器学习研究会

9+阅读 · 2018年3月12日

语义分割+视频分割开源代码集合

语义分割+视频分割开源代码集合

极市平台

35+阅读 · 2018年3月5日

整合全部顶尖目标检测算法：FAIR开源Detectron

整合全部顶尖目标检测算法：FAIR开源Detectron

炼数成金订阅号

6+阅读 · 2018年1月25日

【资源】整合全部顶尖目标检测算法：FAIR开源Detectron

【资源】整合全部顶尖目标检测算法：FAIR开源Detectron

GAN生成式对抗网络

4+阅读 · 2018年1月24日

Mask R-CNN 源代码终上线，Facebook 开源目标检测平台—Detectron

Mask R-CNN 源代码终上线，Facebook 开源目标检测平台—Detectron

AI100

7+阅读 · 2018年1月24日

资源 | 整合全部顶尖目标检测算法：FAIR开源Detectron

资源 | 整合全部顶尖目标检测算法：FAIR开源Detectron

机器之心

3+阅读 · 2018年1月23日

相关论文

End-to-End Object Detection with Fully Convolutional Network

Arxiv

9+阅读 · 2021年3月26日

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Arxiv

17+阅读 · 2020年3月31日

NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection

Arxiv

7+阅读 · 2019年4月16日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

Progressive Sparse Local Attention for Video object detection

Arxiv

4+阅读 · 2019年3月21日

3D Backbone Network for 3D Object Detection

Arxiv

12+阅读 · 2019年1月24日

Receptive Field Block Net for Accurate and Fast Object Detection

Receptive Field Block Net for Accurate and Fast Object Detection

Arxiv

3+阅读 · 2018年7月26日

Localization Recall Precision (LRP): A New Performance Metric for Object Detection

Localization Recall Precision (LRP): A New Performance Metric for Object Detection

Arxiv

4+阅读 · 2018年7月4日

DetNet: A Backbone network for Object Detection

Arxiv

5+阅读 · 2018年4月17日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

微信扫码咨询专知VIP会员