ODAM：基于梯度的目标检测单例视觉解释方法 (ODAM: Gradient-based instance-specific visual explanations for object detection) - 专知论文

会员服务 ·

0

检测器 · 基于梯度的 · 梯度 · 目标检测 · 类激活图 ·

2023 年 4 月 13 日

ODAM: Gradient-based instance-specific visual explanations for object detection

翻译：ODAM：基于梯度的目标检测单例视觉解释方法

Chenyang Zhao,Antoni B. Chan

from arxiv, 2023 International Conference on Learning Representations

We propose the gradient-weighted Object Detector Activation Maps (ODAM), a visualized explanation technique for interpreting the predictions of object detectors. Utilizing the gradients of detector targets flowing into the intermediate feature maps, ODAM produces heat maps that show the influence of regions on the detector's decision for each predicted attribute. Compared to previous works classification activation maps (CAM), ODAM generates instance-specific explanations rather than class-specific ones. We show that ODAM is applicable to both one-stage detectors and two-stage detectors with different types of detector backbones and heads, and produces higher-quality visual explanations than the state-of-the-art both effectively and efficiently. We next propose a training scheme, Odam-Train, to improve the explanation ability on object discrimination of the detector through encouraging consistency between explanations for detections on the same object, and distinct explanations for detections on different objects. Based on the heat maps produced by ODAM with Odam-Train, we propose Odam-NMS, which considers the information of the model's explanation for each prediction to distinguish the duplicate detected objects. We present a detailed analysis of the visualized explanations of detectors and carry out extensive experiments to validate the effectiveness of the proposed ODAM.

翻译：我们提出了基于梯度的Object Detector Activation Maps（ODAM），一种用于解释目标检测器的预测的可视化技术。ODAM利用流向中间特征图的检测器目标的梯度，生成热力图，显示对每个预测属性的检测器决策的区域影响。与之前的分类激活图（CAM）相比，ODAM生成的是单个目标的解释而不是类别特异性的解释。我们展示了ODAM适用于不同类型的检测器骨干和头的单阶段检测器和双阶段检测器，并且在效果和效率上比最先进的技术都更优秀。接下来，我们提出了一个训练方案Odam-Train，通过鼓励同一对象的检测的解释之间的一致性和不同对象的检测的区别来提高检测器对目标识别的解释能力。基于ODAM和Odam-Train生成的热力图，我们提出了Odam-NMS，该方法考虑模型对每个预测的解释信息，以区分重复的检测目标。我们对检测器的可视化解释进行了详细的分析，并进行了广泛的实验来验证ODAM的有效性。

0

相关内容

检测器

【CVPR 2022】使用多模态Transformer的端到端视频对象分割，End-to-End Referring Video Object Segmentation with Multimodal Transformer

【CVPR 2022】使用多模态Transformer的端到端视频对象分割，End-to-End Referring Video Object Segmentation with Multimodal Transformer

专知会员服务

28+阅读 · 2022年3月3日

【AAAI2022】基于秩模仿和预测引导特征模仿的目标检测知识蒸馏

【AAAI2022】基于秩模仿和预测引导特征模仿的目标检测知识蒸馏

专知会员服务

24+阅读 · 2021年12月12日

【ICCV2021】用于目标检测和实例分割的新损失函数

专知会员服务

22+阅读 · 2021年7月28日

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

专知会员服务

37+阅读 · 2020年3月27日

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

专知会员服务

51+阅读 · 2020年3月17日

【上海交大】可解释CNN的对象分类，Interpretable CNNs for Object Classification

专知会员服务

54+阅读 · 2020年3月14日

【CVPR2020-加州理工大学Devi Parikh】多任务视觉和语言表示学习

【CVPR2020-加州理工大学Devi Parikh】多任务视觉和语言表示学习

专知会员服务

38+阅读 · 2020年2月25日

【斯坦福大学ICLR2020】无任务的持续元学习，Continue Meta-learning without tasks

【斯坦福大学ICLR2020】无任务的持续元学习，Continue Meta-learning without tasks

专知会员服务

16+阅读 · 2019年12月18日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【CVPR 2019 | tutorial】视觉识别Visual Recognition and Beyond，Facebook|Ross Girshick，Justin Johnson（李飞飞高徒）

【CVPR 2019 | tutorial】视觉识别Visual Recognition and Beyond，Facebook|Ross Girshick，Justin Johnson（李飞飞高徒）

专知会员服务

29+阅读 · 2019年6月16日

首个目标检测扩散模型，比Faster R-CNN、DETR好，从随机框中直接检测

首个目标检测扩散模型，比Faster R-CNN、DETR好，从随机框中直接检测

机器之心

1+阅读 · 2022年11月21日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Single-Shot Object Detection with Enriched Semantics

Single-Shot Object Detection with Enriched Semantics

统计学习与视觉计算组

14+阅读 · 2018年8月29日

论文笔记之Feature Selective Networks for Object Detection

论文笔记之Feature Selective Networks for Object Detection

统计学习与视觉计算组

21+阅读 · 2018年7月26日

「目标检测算法」连连看：从 Faster R-CNN 、 R-FCN 到 FPN

「目标检测算法」连连看：从 Faster R-CNN 、 R-FCN 到 FPN

AI研习社

10+阅读 · 2018年5月12日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

稀有变异的有效发现与识别

国家自然科学基金

0+阅读 · 2013年12月31日

多约束条件下的目标超分辨检测方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

UGT1A新等位基因功能的研究及UGT1A基因多态性与伊立替康疗效和毒性的关系

国家自然科学基金

0+阅读 · 2013年12月31日

基于阴影恢复技术的SAR三维重建与目标检测方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

水下障碍物声波探测与识别的拓扑灵敏度方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

PTBP1介导的survivinΔEx3过表达调控胶质母细胞瘤微血管增生的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于粘滑原理的球形检测机器人原地转向运动时变滑模控制方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

非参数CFAR检测理论及应用

国家自然科学基金

0+阅读 · 2011年12月31日

循环间充质干细胞的成骨潜能及机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

组蛋白修饰调控异染色质边界的分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

Multi-modal Queried Object Detection in the Wild

Arxiv

0+阅读 · 2023年5月30日

Towards Visual Saliency Explanations of Face Recognition

Arxiv

0+阅读 · 2023年5月30日

Learning Weakly Supervised Audio-Visual Violence Detection in Hyperbolic Space

Arxiv

0+阅读 · 2023年5月30日

CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion Models

Arxiv

0+阅读 · 2023年5月29日

Nested Hierarchical Transformer: Towards Accurate, Data-Efficient and Interpretable Visual Understanding

Arxiv

12+阅读 · 2021年12月30日

Knowledge Distillation for Object Detection via Rank Mimicking and Prediction-guided Feature Imitation

Arxiv

11+阅读 · 2021年12月9日

Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection

Arxiv

13+阅读 · 2020年12月3日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

Deep Learning for Generic Object Detection: A Survey

Deep Learning for Generic Object Detection: A Survey

Arxiv

14+阅读 · 2018年9月6日

Mobile Video Object Detection with Temporally-Aware Feature Maps

Arxiv

11+阅读 · 2018年3月28日

VIP会员

文章信息

相关主题

基于梯度的

相关VIP内容

【CVPR 2022】使用多模态Transformer的端到端视频对象分割，End-to-End Referring Video Object Segmentation with Multimodal Transformer

【CVPR 2022】使用多模态Transformer的端到端视频对象分割，End-to-End Referring Video Object Segmentation with Multimodal Transformer

专知会员服务

28+阅读 · 2022年3月3日

【AAAI2022】基于秩模仿和预测引导特征模仿的目标检测知识蒸馏

【AAAI2022】基于秩模仿和预测引导特征模仿的目标检测知识蒸馏

专知会员服务

24+阅读 · 2021年12月12日

【ICCV2021】用于目标检测和实例分割的新损失函数

专知会员服务

22+阅读 · 2021年7月28日

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

专知会员服务

37+阅读 · 2020年3月27日

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

专知会员服务

51+阅读 · 2020年3月17日

【上海交大】可解释CNN的对象分类，Interpretable CNNs for Object Classification

专知会员服务

54+阅读 · 2020年3月14日

【CVPR2020-加州理工大学Devi Parikh】多任务视觉和语言表示学习

【CVPR2020-加州理工大学Devi Parikh】多任务视觉和语言表示学习

专知会员服务

38+阅读 · 2020年2月25日

【斯坦福大学ICLR2020】无任务的持续元学习，Continue Meta-learning without tasks

【斯坦福大学ICLR2020】无任务的持续元学习，Continue Meta-learning without tasks

专知会员服务

16+阅读 · 2019年12月18日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【CVPR 2019 | tutorial】视觉识别Visual Recognition and Beyond，Facebook|Ross Girshick，Justin Johnson（李飞飞高徒）

【CVPR 2019 | tutorial】视觉识别Visual Recognition and Beyond，Facebook|Ross Girshick，Justin Johnson（李飞飞高徒）

专知会员服务

29+阅读 · 2019年6月16日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

多模态大语言模型下游调优中“保持自我”的重要性

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

相关资讯

首个目标检测扩散模型，比Faster R-CNN、DETR好，从随机框中直接检测

首个目标检测扩散模型，比Faster R-CNN、DETR好，从随机框中直接检测

机器之心

1+阅读 · 2022年11月21日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Single-Shot Object Detection with Enriched Semantics

Single-Shot Object Detection with Enriched Semantics

统计学习与视觉计算组

14+阅读 · 2018年8月29日

论文笔记之Feature Selective Networks for Object Detection

论文笔记之Feature Selective Networks for Object Detection

统计学习与视觉计算组

21+阅读 · 2018年7月26日

「目标检测算法」连连看：从 Faster R-CNN 、 R-FCN 到 FPN

「目标检测算法」连连看：从 Faster R-CNN 、 R-FCN 到 FPN

AI研习社

10+阅读 · 2018年5月12日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

相关论文

Multi-modal Queried Object Detection in the Wild

Arxiv

0+阅读 · 2023年5月30日

Towards Visual Saliency Explanations of Face Recognition

Arxiv

0+阅读 · 2023年5月30日

Learning Weakly Supervised Audio-Visual Violence Detection in Hyperbolic Space

Arxiv

0+阅读 · 2023年5月30日

CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion Models

Arxiv

0+阅读 · 2023年5月29日

Nested Hierarchical Transformer: Towards Accurate, Data-Efficient and Interpretable Visual Understanding

Arxiv

12+阅读 · 2021年12月30日

Knowledge Distillation for Object Detection via Rank Mimicking and Prediction-guided Feature Imitation

Arxiv

11+阅读 · 2021年12月9日

Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection

Arxiv

13+阅读 · 2020年12月3日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

Deep Learning for Generic Object Detection: A Survey

Deep Learning for Generic Object Detection: A Survey

Arxiv

14+阅读 · 2018年9月6日

Mobile Video Object Detection with Temporally-Aware Feature Maps

Arxiv

11+阅读 · 2018年3月28日

相关基金

稀有变异的有效发现与识别

国家自然科学基金

0+阅读 · 2013年12月31日

多约束条件下的目标超分辨检测方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

UGT1A新等位基因功能的研究及UGT1A基因多态性与伊立替康疗效和毒性的关系

国家自然科学基金

0+阅读 · 2013年12月31日

基于阴影恢复技术的SAR三维重建与目标检测方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

水下障碍物声波探测与识别的拓扑灵敏度方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

PTBP1介导的survivinΔEx3过表达调控胶质母细胞瘤微血管增生的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于粘滑原理的球形检测机器人原地转向运动时变滑模控制方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

非参数CFAR检测理论及应用

国家自然科学基金

0+阅读 · 2011年12月31日

循环间充质干细胞的成骨潜能及机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

组蛋白修饰调控异染色质边界的分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员