GateHUB: 具有在线行动探测背景禁止措施的历史古迹股 (GateHUB: Gated History Unit with Background Suppression for Online Action Detection) - 专知论文

会员服务 ·

0

INFORMS · 门控 · Extensibility · MoDELS · 在线 ·

2022 年 6 月 9 日

GateHUB: Gated History Unit with Background Suppression for Online Action Detection

翻译：GateHUB: 具有在线行动探测背景禁止措施的历史古迹股

Junwen Chen,Gaurav Mittal,Ye Yu,Yu Kong,Mei Chen

from arxiv, CVPR 2022

Online action detection is the task of predicting the action as soon as it happens in a streaming video. A major challenge is that the model does not have access to the future and has to solely rely on the history, i.e., the frames observed so far, to make predictions. It is therefore important to accentuate parts of the history that are more informative to the prediction of the current frame. We present GateHUB, Gated History Unit with Background Suppression, that comprises a novel position-guided gated cross-attention mechanism to enhance or suppress parts of the history as per how informative they are for current frame prediction. GateHUB further proposes Future-augmented History (FaH) to make history features more informative by using subsequently observed frames when available. In a single unified framework, GateHUB integrates the transformer's ability of long-range temporal modeling and the recurrent model's capacity to selectively encode relevant information. GateHUB also introduces a background suppression objective to further mitigate false positive background frames that closely resemble the action frames. Extensive validation on three benchmark datasets, THUMOS, TVSeries, and HDD, demonstrates that GateHUB significantly outperforms all existing methods and is also more efficient than the existing best work. Furthermore, a flow-free version of GateHUB is able to achieve higher or close accuracy at 2.8x higher frame rate compared to all existing methods that require both RGB and optical flow information for prediction.

翻译：在线行动探测是一项任务,即一旦在流动视频中发生,即对行动进行预测。一个重大挑战是模型无法进入未来,只能依靠历史,即迄今为止所观察到的框架,作出预测。因此,必须突出历史中对当前框架预测更为丰富的部分信息。我们介绍GateHUB,GateHUB,GateTHUB,带有《背景限制》的GateTHUB,它包含一个新颖的定位制导的封闭式交叉注意机制,目的是根据当前框架预测的信息,加强或压制历史的某些部分。GateHUB进一步提议未来启动历史,以便通过使用以后所观察到的框架使历史特征更加丰富信息。在一个单一的统一框架内,GateHUB将变异器的长程模型能力与经常性模型对相关信息进行选择性编码的能力结合起来。GateHUB还引入了背景抑制目标,以进一步减少与行动框架相近的虚假正面背景框架。在三个基准数据集上进行广泛的验证,THUMOOS, TVSSRESrimes, 和GHDDF的更精确的精确度, 也大大地要求所有现有的GOLF 和GUB 的流程框架, 都比目前更精确的方法都更精确的系统,要比现在更精确的流程和HDDDDDDDF 显示所有。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

Cidea和Fsp27蛋白调控机体脂代谢的功能研究

国家自然科学基金

0+阅读 · 2017年12月31日

蛋白精氨酸甲基转移酶PRMT5和PRMT7对小鼠胚胎干细胞功能影响的研究

国家自然科学基金

0+阅读 · 2015年12月31日

线粒体定位的MICAL2基因选择性剪接体调控肺癌细胞凋亡的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

Neddylation修饰催化酶UBC12作为新型抗肺癌分子靶点的鉴定及其机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于基因-蛋白质-代谢物调控网络的极端微生物耐辐射分子机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

哈萨克羊脾脏转录组分析及免疫相关基因鉴定

国家自然科学基金

0+阅读 · 2012年12月31日

金属晶粒长大动力学的多尺度模拟

国家自然科学基金

0+阅读 · 2012年12月31日

钢筋混凝土框架-摇摆墙新型结构体系研究

国家自然科学基金

0+阅读 · 2011年12月31日

新型碱金属碱土金属硼酸盐材料结构与荧光性质研究

国家自然科学基金

0+阅读 · 2009年12月31日

On the Relation Between Opinion Change and Information Consumption on Reddit

Arxiv

0+阅读 · 2022年7月25日

Multi-Faceted Distillation of Base-Novel Commonality for Few-shot Object Detection

Arxiv

0+阅读 · 2022年7月22日

A Transferable Intersection Reconstruction Network for Traffic Speed Prediction

Arxiv

0+阅读 · 2022年7月22日

In Defense of Online Models for Video Instance Segmentation

Arxiv

0+阅读 · 2022年7月21日

Robustness of Neural Architectures for Audio Event Detection

Robustness of Neural Architectures for Audio Event Detection

Arxiv

0+阅读 · 2022年7月21日

The Neural Race Reduction: Dynamics of Abstraction in Gated Networks

Arxiv

0+阅读 · 2022年7月21日

Emotion analysis and detection during COVID-19

Arxiv

0+阅读 · 2022年7月21日

Learning from History: Modeling Temporal Knowledge Graphs with Sequential Copy-Generation Networks

Arxiv

11+阅读 · 2020年12月15日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

Multimodal Sentiment Analysis using Hierarchical Fusion with Context Modeling

Arxiv

11+阅读 · 2018年6月16日

VIP会员

文章信息

相关主题

相关VIP内容

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

On the Relation Between Opinion Change and Information Consumption on Reddit

Arxiv

0+阅读 · 2022年7月25日

Multi-Faceted Distillation of Base-Novel Commonality for Few-shot Object Detection

Arxiv

0+阅读 · 2022年7月22日

A Transferable Intersection Reconstruction Network for Traffic Speed Prediction

Arxiv

0+阅读 · 2022年7月22日

In Defense of Online Models for Video Instance Segmentation

Arxiv

0+阅读 · 2022年7月21日

Robustness of Neural Architectures for Audio Event Detection

Robustness of Neural Architectures for Audio Event Detection

Arxiv

0+阅读 · 2022年7月21日

The Neural Race Reduction: Dynamics of Abstraction in Gated Networks

Arxiv

0+阅读 · 2022年7月21日

Emotion analysis and detection during COVID-19

Arxiv

0+阅读 · 2022年7月21日

Learning from History: Modeling Temporal Knowledge Graphs with Sequential Copy-Generation Networks

Arxiv

11+阅读 · 2020年12月15日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

Multimodal Sentiment Analysis using Hierarchical Fusion with Context Modeling

Arxiv

11+阅读 · 2018年6月16日

相关基金

Cidea和Fsp27蛋白调控机体脂代谢的功能研究

国家自然科学基金

0+阅读 · 2017年12月31日

蛋白精氨酸甲基转移酶PRMT5和PRMT7对小鼠胚胎干细胞功能影响的研究

国家自然科学基金

0+阅读 · 2015年12月31日

线粒体定位的MICAL2基因选择性剪接体调控肺癌细胞凋亡的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

Neddylation修饰催化酶UBC12作为新型抗肺癌分子靶点的鉴定及其机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于基因-蛋白质-代谢物调控网络的极端微生物耐辐射分子机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

哈萨克羊脾脏转录组分析及免疫相关基因鉴定

国家自然科学基金

0+阅读 · 2012年12月31日

金属晶粒长大动力学的多尺度模拟

国家自然科学基金

0+阅读 · 2012年12月31日

钢筋混凝土框架-摇摆墙新型结构体系研究

国家自然科学基金

0+阅读 · 2011年12月31日

新型碱金属碱土金属硼酸盐材料结构与荧光性质研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员