QAHOI: 以查询为基础的人体物体交互探测锁定器 (QAHOI: Query-Based Anchors for Human-Object Interaction Detection) - 专知论文

会员服务 ·

0

INTERACT · anchor · Backbone · HTTPS · state-of-the-art ·

2021 年 12 月 16 日

QAHOI: Query-Based Anchors for Human-Object Interaction Detection

翻译：QAHOI: 以查询为基础的人体物体交互探测锁定器

Junwen Chen,Keiji Yanai

Human-object interaction (HOI) detection as a downstream of object detection tasks requires localizing pairs of humans and objects and extracting the semantic relationships between humans and objects from an image. Recently, one-stage approaches have become a new trend for this task due to their high efficiency. However, these approaches focus on detecting possible interaction points or filtering human-object pairs, ignoring the variability in the location and size of different objects at spatial scales. To address this problem, we propose a transformer-based method, QAHOI (Query-Based Anchors for Human-Object Interaction detection), which leverages a multi-scale architecture to extract features from different spatial scales and uses query-based anchors to predict all the elements of an HOI instance. We further investigate that a powerful backbone significantly increases accuracy for QAHOI, and QAHOI with a transformer-based backbone outperforms recent state-of-the-art methods by large margins on the HICO-DET benchmark. The source code is available at $\href{https://github.com/cjw2021/QAHOI}{\text{this https URL}}$.

翻译：人类物体相互作用(HOI)检测作为物体探测任务下游的物体探测任务,需要将人类和物体的对子定位,并从图像中提取人与物体之间的语义关系。最近,由于效率高,一阶段方法已成为这项任务的新趋势。然而,这些方法侧重于探测可能的相互作用点或过滤人体物体对子,忽视空间尺度上不同物体的位置和大小的变异性。为了解决这一问题,我们提议采用变压器法,QAHOI(人类物体相互作用探测的以查询为基础的锚),利用多尺度结构从不同的空间尺度上提取特征,并使用基于查询的锚来预测HOI实例的所有元素。我们进一步调查,强大的脊椎大大提高了QAHOHOI和具有以变压器为主脊的QAHOI的精度,在HICO-DET基准上以大边距显示最近的状态-艺术方法。源代码可在 $href$_hus_QHOImbU. /QULAchus_Qrus.commexus.

0

相关内容

INTERACT

IFIP TC13 Conference on Human-Computer Interaction是人机交互领域的研究者和实践者展示其工作的重要平台。多年来，这些会议吸引了来自几个国家和文化的研究人员。官网链接：http://interact2019.org/

【AAAI2022】锚点DETR：基于transformer检测器的查询设计

【AAAI2022】锚点DETR：基于transformer检测器的查询设计

专知会员服务

12+阅读 · 2021年12月31日

【ICML2020-斯坦福Facebook-何恺明】神经网络图结构，Graph Structure of Neural Networks

【ICML2020-斯坦福Facebook-何恺明】神经网络图结构，Graph Structure of Neural Networks

专知会员服务

55+阅读 · 2020年7月14日

【KDD2020】动态图的拉普拉斯变换点检测，Laplacian Change Point Detection for Dynamic Graphs

【KDD2020】动态图的拉普拉斯变换点检测，Laplacian Change Point Detection for Dynamic Graphs

专知会员服务

35+阅读 · 2020年7月3日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

30+阅读 · 2020年5月12日

CVPR 2020 论文开源项目合集

专知会员服务

109+阅读 · 2020年3月12日

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

专知会员服务

133+阅读 · 2020年3月8日

【论文|Google】基于元学习的排序架构，Ranking architectures using meta-learning

【论文|Google】基于元学习的排序架构，Ranking architectures using meta-learning

专知会员服务

17+阅读 · 2019年11月30日

【ICCV2019教程】物体检测的R-CNN通用框架，The Generalized R-CNN Framework for Object Detection，180页ppt，Facebook 人工智能研究院Ross Girshick大神

【ICCV2019教程】物体检测的R-CNN通用框架，The Generalized R-CNN Framework for Object Detection，180页ppt，Facebook 人工智能研究院Ross Girshick大神

专知会员服务

23+阅读 · 2019年11月16日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

53+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

77+阅读 · 2019年10月10日

最快人脸检测遇敌手！ZQCNN vs libfacedetection

最快人脸检测遇敌手！ZQCNN vs libfacedetection

极市平台

19+阅读 · 2019年3月27日

人脸检测库：libfacedetection

人脸检测库：libfacedetection

Python程序员

15+阅读 · 2019年3月22日

Github项目推荐 | 比快更快！速度超越OpenCV的人脸检测库 libfacedetection 开源！

Github项目推荐 | 比快更快！速度超越OpenCV的人脸检测库 libfacedetection 开源！

AI研习社

10+阅读 · 2019年3月18日

CVPR2019 | 目标检测新文：Generalized Intersection over Union

CVPR2019 | 目标检测新文：Generalized Intersection over Union

极市平台

8+阅读 · 2019年2月28日

已删除

将门创投

4+阅读 · 2018年11月20日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

Mask R-CNN 源代码终上线，Facebook 开源目标检测平台—Detectron

Mask R-CNN 源代码终上线，Facebook 开源目标检测平台—Detectron

AI100

7+阅读 · 2018年1月24日

论文浅尝 | Improved Neural Relation Detection for KBQA

论文浅尝 | Improved Neural Relation Detection for KBQA

开放知识图谱

13+阅读 · 2018年1月21日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

Anchor DETR: Query Design for Transformer-Based Detector

Arxiv

5+阅读 · 2021年9月15日

General Instance Distillation for Object Detection

Arxiv

9+阅读 · 2021年3月3日

Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection

Arxiv

4+阅读 · 2020年12月31日

Query Understanding via Intent Description Generation

Arxiv

9+阅读 · 2020年8月25日

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Arxiv

17+阅读 · 2020年3月31日

VSGNet: Spatial Attention Network for Detecting Human Object Interactions Using Graph Convolutions

VSGNet: Spatial Attention Network for Detecting Human Object Interactions Using Graph Convolutions

Arxiv

7+阅读 · 2020年3月11日

FoveaBox: Beyond Anchor-based Object Detector

Arxiv

5+阅读 · 2019年4月8日

Augmentation for small object detection

Augmentation for small object detection

Arxiv

11+阅读 · 2019年2月19日

Zero-Shot Detection

Arxiv

6+阅读 · 2018年3月19日

Arxiv

7+阅读 · 2018年1月24日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

【AAAI2022】锚点DETR：基于transformer检测器的查询设计

【AAAI2022】锚点DETR：基于transformer检测器的查询设计

专知会员服务

12+阅读 · 2021年12月31日

【ICML2020-斯坦福Facebook-何恺明】神经网络图结构，Graph Structure of Neural Networks

【ICML2020-斯坦福Facebook-何恺明】神经网络图结构，Graph Structure of Neural Networks

专知会员服务

55+阅读 · 2020年7月14日

【KDD2020】动态图的拉普拉斯变换点检测，Laplacian Change Point Detection for Dynamic Graphs

【KDD2020】动态图的拉普拉斯变换点检测，Laplacian Change Point Detection for Dynamic Graphs

专知会员服务

35+阅读 · 2020年7月3日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

30+阅读 · 2020年5月12日

CVPR 2020 论文开源项目合集

专知会员服务

109+阅读 · 2020年3月12日

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

专知会员服务

133+阅读 · 2020年3月8日

【论文|Google】基于元学习的排序架构，Ranking architectures using meta-learning

【论文|Google】基于元学习的排序架构，Ranking architectures using meta-learning

专知会员服务

17+阅读 · 2019年11月30日

【ICCV2019教程】物体检测的R-CNN通用框架，The Generalized R-CNN Framework for Object Detection，180页ppt，Facebook 人工智能研究院Ross Girshick大神

【ICCV2019教程】物体检测的R-CNN通用框架，The Generalized R-CNN Framework for Object Detection，180页ppt，Facebook 人工智能研究院Ross Girshick大神

专知会员服务

23+阅读 · 2019年11月16日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

53+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

77+阅读 · 2019年10月10日

热门VIP内容

相关资讯

最快人脸检测遇敌手！ZQCNN vs libfacedetection

最快人脸检测遇敌手！ZQCNN vs libfacedetection

极市平台

19+阅读 · 2019年3月27日

人脸检测库：libfacedetection

人脸检测库：libfacedetection

Python程序员

15+阅读 · 2019年3月22日

Github项目推荐 | 比快更快！速度超越OpenCV的人脸检测库 libfacedetection 开源！

Github项目推荐 | 比快更快！速度超越OpenCV的人脸检测库 libfacedetection 开源！

AI研习社

10+阅读 · 2019年3月18日

CVPR2019 | 目标检测新文：Generalized Intersection over Union

CVPR2019 | 目标检测新文：Generalized Intersection over Union

极市平台

8+阅读 · 2019年2月28日

已删除

将门创投

4+阅读 · 2018年11月20日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

Mask R-CNN 源代码终上线，Facebook 开源目标检测平台—Detectron

Mask R-CNN 源代码终上线，Facebook 开源目标检测平台—Detectron

AI100

7+阅读 · 2018年1月24日

论文浅尝 | Improved Neural Relation Detection for KBQA

论文浅尝 | Improved Neural Relation Detection for KBQA

开放知识图谱

13+阅读 · 2018年1月21日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

相关论文

Anchor DETR: Query Design for Transformer-Based Detector

Arxiv

5+阅读 · 2021年9月15日

General Instance Distillation for Object Detection

Arxiv

9+阅读 · 2021年3月3日

Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection

Arxiv

4+阅读 · 2020年12月31日

Query Understanding via Intent Description Generation

Arxiv

9+阅读 · 2020年8月25日

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Arxiv

17+阅读 · 2020年3月31日

VSGNet: Spatial Attention Network for Detecting Human Object Interactions Using Graph Convolutions

VSGNet: Spatial Attention Network for Detecting Human Object Interactions Using Graph Convolutions

Arxiv

7+阅读 · 2020年3月11日

FoveaBox: Beyond Anchor-based Object Detector

Arxiv

5+阅读 · 2019年4月8日

Augmentation for small object detection

Augmentation for small object detection

Arxiv

11+阅读 · 2019年2月19日

Zero-Shot Detection

Arxiv

6+阅读 · 2018年3月19日

Arxiv

7+阅读 · 2018年1月24日

微信扫码咨询专知VIP会员