以及何时查看? : 视频关系探测时间分布建议网络 (What and When to Look?: Temporal Span Proposal Network for Video Relation Detection) - 专知论文

会员服务 ·

0

张成子空间 · Networking · 可辨认的 · INTERACT · HTTPS ·

2022 年 10 月 5 日

What and When to Look?: Temporal Span Proposal Network for Video Relation Detection

翻译：以及何时查看? : 视频关系探测时间分布建议网络

Sangmin Woo,Junhyug Noh,Kangil Kim

from arxiv, This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Identifying relations between objects is central to understanding the scene. While several works have been proposed for relation modeling in the image domain, there have been many constraints in the video domain due to challenging dynamics of spatio-temporal interactions (e.g., between which objects are there an interaction? when do relations start and end?). To date, two representative methods have been proposed to tackle Video Visual Relation Detection (VidVRD): segment-based and window-based. We first point out limitations of these methods and propose a novel approach named Temporal Span Proposal Network (TSPN). TSPN tells what to look: it sparsifies relation search space by scoring relationness of object pair, i.e., measuring how probable a relation exist. TSPN tells when to look: it simultaneously predicts start-end timestamps (i.e., temporal spans) and categories of the all possible relations by utilizing full video context. These two designs enable a win-win scenario: it accelerates training by 2X or more than existing methods and achieves competitive performance on two VidVRD benchmarks (ImageNet-VidVDR and VidOR). Moreover, comprehensive ablative experiments demonstrate the effectiveness of our approach. Codes are available at https://github.com/sangminwoo/Temporal-Span-Proposal-Network-VidVRD.

翻译：辨别对象之间的关系是了解场景的核心。虽然在图像域中为建立关系模型提出了几项工作提案,但由于具有挑战性的spatio-时空互动动态(例如,在哪些对象之间存在互动?什么时候关系开始和结束?),视频领域存在许多限制(例如,在哪些对象之间有互动?何时开始和结束关系?) 。迄今为止,提出了两种具有代表性的方法来解决视频视觉关系探测(VidVRD):基于部分和基于窗口的。我们首先指出这些方法的局限性,并提出一种名为Tempoal Span Propos 网络的新颖方法(TSPN)。 TSPN告诉人们要看什么:它通过对对象对对象的对应关系进行评分来缩小关系搜索空间。 TSPN告诉人们:它同时预测启动-end timetamps(例如,时间跨度) 以及所有可能的关系类别。这两种设计可以促成双赢情景:它加快了2X或超过现有方法的培训,并在两个VRDRD基准(ImageNet-VBRVD/VDR)上实现竞争性业绩(IMG-VG/VDRVADRVD/VDR)中的全面测试方法)。

0

相关内容

张成子空间

张成子空间

【CVPR 2022】基于粗粒度和细粒度特征匹配的视频描述评估，EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching

【CVPR 2022】基于粗粒度和细粒度特征匹配的视频描述评估，EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching

专知会员服务

10+阅读 · 2022年3月19日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

19+阅读 · 2017年12月17日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

单晶铁磁金属/重金属异质结构中自旋-轨道力矩效应研究

国家自然科学基金

0+阅读 · 2014年12月31日

NLRP3炎症小体介导Th亚群失衡在急性髓性白血病中的作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

小GTP酶Rab23通过Rac1调控乳腺癌细胞迁移和侵袭的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

量子点修饰TiO2纳米管阵列异质结构及性能调控研究

国家自然科学基金

0+阅读 · 2012年12月31日

基质中成纤维细胞在乳腺癌内分泌耐药中的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

近邻星系的中远红外性质

国家自然科学基金

0+阅读 · 2011年12月31日

多能级原子系综中双通道自旋极化干涉效应的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

HAT/HDAC失衡与乙酰化修饰异常：急性肺损伤炎症失控新机制

国家自然科学基金

0+阅读 · 2009年12月31日

约化群酉表示的branching law及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

GoRela: Go Relative for Viewpoint-Invariant Motion Forecasting

Arxiv

0+阅读 · 2022年11月8日

A Simple Algorithm for Online Decision Making

Arxiv

0+阅读 · 2022年11月8日

Automatic Change-Point Detection in Time Series via Deep Learning

Arxiv

0+阅读 · 2022年11月7日

Efficient Multi-order Gated Aggregation Network

Arxiv

0+阅读 · 2022年11月7日

Leveraging Siamese Networks for One-Shot Intrusion Detection Model

Arxiv

0+阅读 · 2022年11月5日

Predictive Querying for Autoregressive Neural Sequence Models

Arxiv

0+阅读 · 2022年11月4日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

Temporal Relational Modeling with Self-Supervision for Action Segmentation

Arxiv

13+阅读 · 2020年12月14日

Adaptive Attentional Network for Few-Shot Knowledge Graph Completion

Arxiv

17+阅读 · 2020年10月19日

Linkage Based Face Clustering via Graph Convolution Network

Arxiv

16+阅读 · 2019年3月27日

VIP会员

文章信息

相关主题

张成子空间

相关VIP内容

【CVPR 2022】基于粗粒度和细粒度特征匹配的视频描述评估，EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching

【CVPR 2022】基于粗粒度和细粒度特征匹配的视频描述评估，EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching

专知会员服务

10+阅读 · 2022年3月19日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能行业：2027年AI预测报告

70页pdf《视觉-语言-动作模型综述：一种基于动作离散化的视角》

训练扩散模型其实比你想象的更简单！何恺明团队新作Dispersive Loss：给扩散模型加正则化

【ICML2025】用于可扩展持续强化学习的自组合策略

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

19+阅读 · 2017年12月17日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

GoRela: Go Relative for Viewpoint-Invariant Motion Forecasting

Arxiv

0+阅读 · 2022年11月8日

A Simple Algorithm for Online Decision Making

Arxiv

0+阅读 · 2022年11月8日

Automatic Change-Point Detection in Time Series via Deep Learning

Arxiv

0+阅读 · 2022年11月7日

Efficient Multi-order Gated Aggregation Network

Arxiv

0+阅读 · 2022年11月7日

Leveraging Siamese Networks for One-Shot Intrusion Detection Model

Arxiv

0+阅读 · 2022年11月5日

Predictive Querying for Autoregressive Neural Sequence Models

Arxiv

0+阅读 · 2022年11月4日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

Temporal Relational Modeling with Self-Supervision for Action Segmentation

Arxiv

13+阅读 · 2020年12月14日

Adaptive Attentional Network for Few-Shot Knowledge Graph Completion

Arxiv

17+阅读 · 2020年10月19日

Linkage Based Face Clustering via Graph Convolution Network

Arxiv

16+阅读 · 2019年3月27日

相关基金

单晶铁磁金属/重金属异质结构中自旋-轨道力矩效应研究

国家自然科学基金

0+阅读 · 2014年12月31日

NLRP3炎症小体介导Th亚群失衡在急性髓性白血病中的作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

小GTP酶Rab23通过Rac1调控乳腺癌细胞迁移和侵袭的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

量子点修饰TiO2纳米管阵列异质结构及性能调控研究

国家自然科学基金

0+阅读 · 2012年12月31日

基质中成纤维细胞在乳腺癌内分泌耐药中的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

近邻星系的中远红外性质

国家自然科学基金

0+阅读 · 2011年12月31日

多能级原子系综中双通道自旋极化干涉效应的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

HAT/HDAC失衡与乙酰化修饰异常：急性肺损伤炎症失控新机制

国家自然科学基金

0+阅读 · 2009年12月31日

约化群酉表示的branching law及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员