GTNet:用于检测人体物体相互作用的指导变异器网络 (GTNet:Guided Transformer Network for Detecting Human-Object Interactions) - 专知论文

会员服务 ·

0

INTERACT · Networking · 变换 · INFORMS · Pair ·

2021 年 8 月 3 日

GTNet:Guided Transformer Network for Detecting Human-Object Interactions

翻译：GTNet:用于检测人体物体相互作用的指导变异器网络

A S M Iftekhar,Satish Kumar,R. Austin McEver,Suya You,B. S. Manjunath

from arxiv, pre-print, the work is in progress

The human-object interaction (HOI) detection task refers to localizing humans, localizing objects, and predicting the interactions between each human-object pair. HOI is considered one of the fundamental steps in truly understanding complex visual scenes. For detecting HOI, it is important to utilize relative spatial configurations and object semantics to find salient spatial regions of images that highlight the interactions between human object pairs. This issue is addressed by the proposed self-attention based guided transformer network, GTNet. GTNet encodes this spatial contextual information in human and object visual features via self-attention while achieving a 4%-6% improvement over previous state of the art results on both the V-COCO and HICO-DET datasets. Code will be made available online.

翻译：人体- 物体互动( HOI) 检测任务是指将人类本地化、物体本地化和预测每个人体- 对象对应方之间的相互作用。 HOI 被视为真正理解复杂视觉场景的基本步骤之一。为了检测 HOI, 使用相对空间配置和物体语义来寻找突出显示人类对象对子之间相互作用的图像的显著空间区域非常重要。这个问题由拟议的基于自我注意的引导变压器网络GTNet 来解决。 GTNet 通过自我注意将人类和物体视觉特征的空间背景信息编码为人类和物体视觉特征,同时比 V- COCO 和 HICO- DET 数据集以往的艺术成果提高4%-6%。代码将在线提供。

0

相关内容

INTERACT

IFIP TC13 Conference on Human-Computer Interaction是人机交互领域的研究者和实践者展示其工作的重要平台。多年来，这些会议吸引了来自几个国家和文化的研究人员。官网链接：http://interact2019.org/

【CVPR2021】用于目标检测的通用实例蒸馏

【CVPR2021】用于目标检测的通用实例蒸馏

专知会员服务

24+阅读 · 2021年3月22日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

325+阅读 · 2020年11月26日

【ACL2020】生成事实验证解释，Generating Fact Checking Explanations

【ACL2020】生成事实验证解释，Generating Fact Checking Explanations

专知会员服务

17+阅读 · 2020年4月15日

【北卡罗莱纳州立大学】单场景视频异常检测综述，A Survey of Single-Scene Video Anomaly Detection

【北卡罗莱纳州立大学】单场景视频异常检测综述，A Survey of Single-Scene Video Anomaly Detection

专知会员服务

31+阅读 · 2020年4月13日

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

专知会员服务

51+阅读 · 2020年3月17日

【厦门大学-CVPR2020】协调可迁移性与可判别性的自适应目标检测器，Adapting Object Detectors

【厦门大学-CVPR2020】协调可迁移性与可判别性的自适应目标检测器，Adapting Object Detectors

专知会员服务

26+阅读 · 2020年3月16日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

CVPR2019| 05-16更新10篇论文及代码合集（含一篇oral，全景分割/文本检测/目标检测等）

CVPR2019| 05-16更新10篇论文及代码合集（含一篇oral，全景分割/文本检测/目标检测等）

极市平台

14+阅读 · 2019年5月16日

Github项目推荐 | 全景分割相关资源列表

Github项目推荐 | 全景分割相关资源列表

AI研习社

9+阅读 · 2019年5月13日

CVPR2019| 05-07更新14篇论文及代码合集（1篇oral，含目标检测/视频分割/目标跟踪等）

CVPR2019| 05-07更新14篇论文及代码合集（1篇oral，含目标检测/视频分割/目标跟踪等）

极市平台

22+阅读 · 2019年5月7日

CVPR2019| 04-08更新19篇论文及代码（1篇oral、目标检测、行人检测、视频超分辨等）

CVPR2019| 04-08更新19篇论文及代码（1篇oral、目标检测、行人检测、视频超分辨等）

极市平台

19+阅读 · 2019年4月8日

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

AI研习社

32+阅读 · 2019年4月5日

CVPR2019 | 03-25日更新12篇论文及代码汇总（目标检测、姿态估计、跟踪、VQA等）

CVPR2019 | 03-25日更新12篇论文及代码汇总（目标检测、姿态估计、跟踪、VQA等）

极市平台

5+阅读 · 2019年3月25日

CVPR2019 | 全景分割：Attention-guided Unified Network

CVPR2019 | 全景分割：Attention-guided Unified Network

极市平台

9+阅读 · 2019年3月3日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

Single-Shot Object Detection with Enriched Semantics

Single-Shot Object Detection with Enriched Semantics

统计学习与视觉计算组

14+阅读 · 2018年8月29日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

OadTR: Online Action Detection with Transformers

Arxiv

7+阅读 · 2021年6月21日

Adaptive Graph Convolutional Network with Attention Graph Clustering for Co-saliency Detection

Adaptive Graph Convolutional Network with Attention Graph Clustering for Co-saliency Detection

Arxiv

10+阅读 · 2020年3月13日

VSGNet: Spatial Attention Network for Detecting Human Object Interactions Using Graph Convolutions

VSGNet: Spatial Attention Network for Detecting Human Object Interactions Using Graph Convolutions

Arxiv

7+阅读 · 2020年3月11日

3D Backbone Network for 3D Object Detection

Arxiv

12+阅读 · 2019年1月24日

Self-Attention Recurrent Network for Saliency Detection

Self-Attention Recurrent Network for Saliency Detection

Arxiv

5+阅读 · 2018年8月5日

Relation Networks for Object Detection

Arxiv

4+阅读 · 2018年6月14日

DetNet: A Backbone network for Object Detection

Arxiv

5+阅读 · 2018年4月17日

Zero-Shot Detection

Arxiv

7+阅读 · 2018年3月19日

An Attention-Based Word-Level Interaction Model: Relation Detection for Knowledge Base Question Answering

Arxiv

6+阅读 · 2018年1月30日

Natural Language Guided Visual Relationship Detection

Arxiv

3+阅读 · 2017年11月21日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR2021】用于目标检测的通用实例蒸馏

【CVPR2021】用于目标检测的通用实例蒸馏

专知会员服务

24+阅读 · 2021年3月22日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

325+阅读 · 2020年11月26日

【ACL2020】生成事实验证解释，Generating Fact Checking Explanations

【ACL2020】生成事实验证解释，Generating Fact Checking Explanations

专知会员服务

17+阅读 · 2020年4月15日

【北卡罗莱纳州立大学】单场景视频异常检测综述，A Survey of Single-Scene Video Anomaly Detection

【北卡罗莱纳州立大学】单场景视频异常检测综述，A Survey of Single-Scene Video Anomaly Detection

专知会员服务

31+阅读 · 2020年4月13日

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

专知会员服务

51+阅读 · 2020年3月17日

【厦门大学-CVPR2020】协调可迁移性与可判别性的自适应目标检测器，Adapting Object Detectors

【厦门大学-CVPR2020】协调可迁移性与可判别性的自适应目标检测器，Adapting Object Detectors

专知会员服务

26+阅读 · 2020年3月16日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

Deep Research（深度研究）：系统性综述

《革新战术战场空间能力：反无人机系统》报告

【普林斯顿博士论文】用于语音的生成式通用模型

螺旋式开发作为战略资产：美军启示

相关资讯

CVPR2019| 05-16更新10篇论文及代码合集（含一篇oral，全景分割/文本检测/目标检测等）

CVPR2019| 05-16更新10篇论文及代码合集（含一篇oral，全景分割/文本检测/目标检测等）

极市平台

14+阅读 · 2019年5月16日

Github项目推荐 | 全景分割相关资源列表

Github项目推荐 | 全景分割相关资源列表

AI研习社

9+阅读 · 2019年5月13日

CVPR2019| 05-07更新14篇论文及代码合集（1篇oral，含目标检测/视频分割/目标跟踪等）

CVPR2019| 05-07更新14篇论文及代码合集（1篇oral，含目标检测/视频分割/目标跟踪等）

极市平台

22+阅读 · 2019年5月7日

CVPR2019| 04-08更新19篇论文及代码（1篇oral、目标检测、行人检测、视频超分辨等）

CVPR2019| 04-08更新19篇论文及代码（1篇oral、目标检测、行人检测、视频超分辨等）

极市平台

19+阅读 · 2019年4月8日

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

AI研习社

32+阅读 · 2019年4月5日

CVPR2019 | 03-25日更新12篇论文及代码汇总（目标检测、姿态估计、跟踪、VQA等）

CVPR2019 | 03-25日更新12篇论文及代码汇总（目标检测、姿态估计、跟踪、VQA等）

极市平台

5+阅读 · 2019年3月25日

CVPR2019 | 全景分割：Attention-guided Unified Network

CVPR2019 | 全景分割：Attention-guided Unified Network

极市平台

9+阅读 · 2019年3月3日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

Single-Shot Object Detection with Enriched Semantics

Single-Shot Object Detection with Enriched Semantics

统计学习与视觉计算组

14+阅读 · 2018年8月29日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

OadTR: Online Action Detection with Transformers

Arxiv

7+阅读 · 2021年6月21日

Adaptive Graph Convolutional Network with Attention Graph Clustering for Co-saliency Detection

Adaptive Graph Convolutional Network with Attention Graph Clustering for Co-saliency Detection

Arxiv

10+阅读 · 2020年3月13日

VSGNet: Spatial Attention Network for Detecting Human Object Interactions Using Graph Convolutions

VSGNet: Spatial Attention Network for Detecting Human Object Interactions Using Graph Convolutions

Arxiv

7+阅读 · 2020年3月11日

3D Backbone Network for 3D Object Detection

Arxiv

12+阅读 · 2019年1月24日

Self-Attention Recurrent Network for Saliency Detection

Self-Attention Recurrent Network for Saliency Detection

Arxiv

5+阅读 · 2018年8月5日

Relation Networks for Object Detection

Arxiv

4+阅读 · 2018年6月14日

DetNet: A Backbone network for Object Detection

Arxiv

5+阅读 · 2018年4月17日

Zero-Shot Detection

Arxiv

7+阅读 · 2018年3月19日

An Attention-Based Word-Level Interaction Model: Relation Detection for Knowledge Base Question Answering

Arxiv

6+阅读 · 2018年1月30日

Natural Language Guided Visual Relationship Detection

Arxiv

3+阅读 · 2017年11月21日

微信扫码咨询专知VIP会员