GiraffeDet:用于探测物体的重颈范式 (GiraffeDet: A Heavy-Neck Paradigm for Object Detection) - 专知论文

会员服务 ·

0

GiraffeDet · INFORMS · 目标检测 · Backbone · Neck ·

2022 年 2 月 9 日

GiraffeDet: A Heavy-Neck Paradigm for Object Detection

翻译：GiraffeDet:用于探测物体的重颈范式

Yiqi Jiang,Zhiyu Tan,Junyan Wang,Xiuyu Sun,Ming Lin,Hao Li

In conventional object detection frameworks, a backbone body inherited from image recognition models extracts deep latent features and then a neck module fuses these latent features to capture information at different scales. As the resolution in object detection is much larger than in image recognition, the computational cost of the backbone often dominates the total inference cost. This heavy-backbone design paradigm is mostly due to the historical legacy when transferring image recognition models to object detection rather than an end-to-end optimized design for object detection. In this work, we show that such paradigm indeed leads to sub-optimal object detection models. To this end, we propose a novel heavy-neck paradigm, GiraffeDet, a giraffe-like network for efficient object detection. The GiraffeDet uses an extremely lightweight backbone and a very deep and large neck module which encourages dense information exchange among different spatial scales as well as different levels of latent semantics simultaneously. This design paradigm allows detectors to process the high-level semantic information and low-level spatial information at the same priority even in the early stage of the network, making it more effective in detection tasks. Numerical evaluations on multiple popular object detection benchmarks show that GiraffeDet consistently outperforms previous SOTA models across a wide spectrum of resource constraints.

翻译：在常规物体探测框架中,一个从图像识别模型继承的骨干体从图像识别模型中提取深潜性特征,然后一个颈部模块将这些潜在特征结合到不同尺度的信息中。由于物体检测中的分辨率比图像识别中的分辨率大得多,因此主干体的计算成本往往占总推断成本的主导。这种重背骨设计范式主要是由于将图像识别模型转移到目标检测而不是最终至最终最佳天体探测设计的历史遗留问题。在这项工作中,我们表明这种范式确实导致次优化天体检测模型。为此,我们提出了一个新的重身范式,即GiraffeDet,即一个类似长颈鹿的网络,用于高效天体检测。GiraffeDet使用一个极轻的脊椎和一个非常深和大型的颈部模块,鼓励在不同空间尺度之间以及不同水平的潜伏语义学上进行密集的信息交流。这一设计范式使探测器能够处理高层次的语义信息和低层次空间空间信息,甚至在网络的早期阶段,从而使其在探测目标探测任务中更加有效。

0

相关内容

GiraffeDet

【决策Transformers 导论】Introducing Decision Transformers on Hugging Face 🤗

【决策Transformers 导论】Introducing Decision Transformers on Hugging Face 🤗

专知会员服务

67+阅读 · 2022年3月29日

【CVPR 2022】C2AM损失：为长尾目标检测任务追求更好的决策边界，C2AM Loss: Chasing a Better Decision Boundary for Long-Tail Object Detection

【CVPR 2022】C2AM损失：为长尾目标检测任务追求更好的决策边界，C2AM Loss: Chasing a Better Decision Boundary for Long-Tail Object Detection

专知会员服务

7+阅读 · 2022年3月19日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【目标检测 | 2019最新综述】目标检测的最新进展，附40页PDF，Recent Advances in Deep Learning for Object Detection

【目标检测 | 2019最新综述】目标检测的最新进展，附40页PDF，Recent Advances in Deep Learning for Object Detection

专知会员服务

85+阅读 · 2019年11月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

专知

74+阅读 · 2018年1月16日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

基于深度信念网络的高光谱遥感影像变化检测方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

多部门机构下的生产规划与资源配置

国家自然科学基金

3+阅读 · 2014年12月31日

基于概率本体的CPS入侵检测方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

面向X-CT应用的(Ce, Lu)3(Cr, Al)5O12闪烁陶瓷中过渡金属离子的光谱展宽效应研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于中红外差频光源的高精度二氧化碳同位素探测

国家自然科学基金

0+阅读 · 2012年12月31日

红外弱小高机动目标实时检测与跟踪方法研究

国家自然科学基金

4+阅读 · 2012年12月31日

基于信息融合核框架的多时相遥感影像特征级变化检测研究

国家自然科学基金

1+阅读 · 2012年12月31日

天基多基地MIMO雷达动目标检测方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

信息物理融合的Web对象可视检索技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于冗余多尺度表示的小框架核回归方法研究及在光谱反射率重建中的应用

国家自然科学基金

0+阅读 · 2009年12月31日

Global-and-Local Collaborative Learning for Co-Salient Object Detection

Arxiv

0+阅读 · 2022年4月19日

Detect-and-describe: Joint learning framework for detection and description of objects

Arxiv

0+阅读 · 2022年4月19日

Point-Level Region Contrast for Object Detection Pre-Training

Arxiv

1+阅读 · 2022年4月19日

CenterNet++ for Object Detection

Arxiv

0+阅读 · 2022年4月18日

Salient Objects in Clutter

Salient Objects in Clutter

Arxiv

0+阅读 · 2022年4月18日

Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence

Arxiv

0+阅读 · 2022年4月18日

Feature Compression for Rate Constrained Object Detection on the Edge

Arxiv

0+阅读 · 2022年4月15日

Attention Bottlenecks for Multimodal Fusion

Arxiv

31+阅读 · 2021年6月30日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

DOTA: A Large-scale Dataset for Object Detection in Aerial Images

Arxiv

19+阅读 · 2018年1月27日

VIP会员

文章信息

相关主题

相关VIP内容

【决策Transformers 导论】Introducing Decision Transformers on Hugging Face 🤗

【决策Transformers 导论】Introducing Decision Transformers on Hugging Face 🤗

专知会员服务

67+阅读 · 2022年3月29日

【CVPR 2022】C2AM损失：为长尾目标检测任务追求更好的决策边界，C2AM Loss: Chasing a Better Decision Boundary for Long-Tail Object Detection

【CVPR 2022】C2AM损失：为长尾目标检测任务追求更好的决策边界，C2AM Loss: Chasing a Better Decision Boundary for Long-Tail Object Detection

专知会员服务

7+阅读 · 2022年3月19日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【目标检测 | 2019最新综述】目标检测的最新进展，附40页PDF，Recent Advances in Deep Learning for Object Detection

【目标检测 | 2019最新综述】目标检测的最新进展，附40页PDF，Recent Advances in Deep Learning for Object Detection

专知会员服务

85+阅读 · 2019年11月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

专知

74+阅读 · 2018年1月16日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

相关论文

Global-and-Local Collaborative Learning for Co-Salient Object Detection

Arxiv

0+阅读 · 2022年4月19日

Detect-and-describe: Joint learning framework for detection and description of objects

Arxiv

0+阅读 · 2022年4月19日

Point-Level Region Contrast for Object Detection Pre-Training

Arxiv

1+阅读 · 2022年4月19日

CenterNet++ for Object Detection

Arxiv

0+阅读 · 2022年4月18日

Salient Objects in Clutter

Salient Objects in Clutter

Arxiv

0+阅读 · 2022年4月18日

Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence

Arxiv

0+阅读 · 2022年4月18日

Feature Compression for Rate Constrained Object Detection on the Edge

Arxiv

0+阅读 · 2022年4月15日

Attention Bottlenecks for Multimodal Fusion

Arxiv

31+阅读 · 2021年6月30日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

DOTA: A Large-scale Dataset for Object Detection in Aerial Images

Arxiv

19+阅读 · 2018年1月27日

相关基金

基于深度信念网络的高光谱遥感影像变化检测方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

多部门机构下的生产规划与资源配置

国家自然科学基金

3+阅读 · 2014年12月31日

基于概率本体的CPS入侵检测方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

面向X-CT应用的(Ce, Lu)3(Cr, Al)5O12闪烁陶瓷中过渡金属离子的光谱展宽效应研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于中红外差频光源的高精度二氧化碳同位素探测

国家自然科学基金

0+阅读 · 2012年12月31日

红外弱小高机动目标实时检测与跟踪方法研究

国家自然科学基金

4+阅读 · 2012年12月31日

基于信息融合核框架的多时相遥感影像特征级变化检测研究

国家自然科学基金

1+阅读 · 2012年12月31日

天基多基地MIMO雷达动目标检测方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

信息物理融合的Web对象可视检索技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于冗余多尺度表示的小框架核回归方法研究及在光谱反射率重建中的应用

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员