建立最终至最终空间-临时行动探测器的最低努力 (Minimum Efforts to Build an End-to-End Spatial-Temporal Action Detector) - 专知论文

会员服务 ·

0

极小点 · 端到端 · Boosting（一种模型训练加速方式） · MoDELS · Performer ·

2022 年 6 月 7 日

Minimum Efforts to Build an End-to-End Spatial-Temporal Action Detector

翻译：建立最终至最终空间-临时行动探测器的最低努力

Lin Sui,Chen-Lin Zhang,Lixin Gu,Feng Han

Spatial-temporal action detection is a vital part of video understanding. Current spatial-temporal action detection methods will first use an object detector to obtain person candidate proposals. Then, the model will classify the person candidates into different action categories. So-called two-stage methods are heavy and hard to apply in real-world applications. Some existing methods use a unified model structure, But they perform badly with the vanilla model and often need extra modules to boost the performance. In this paper, we explore the strategy to build an end-to-end spatial-temporal action detector with minimal modifications. To this end, we propose a new method named ME-STAD, which solves the spatial-temporal action detection problem in an end-to-end manner. Besides the model design, we propose a novel labeling strategy to deal with sparse annotations in spatial-temporal datasets. The proposed ME-STAD achieves better results (2.2% mAP boost) than original two-stage detectors and around 80% FLOPs reduction. Moreover, our proposed ME-STAD only has minimum modifications with previous methods and does not require extra components. Our code will be made public.

翻译：空间时空动作探测是视频理解的一个重要部分。目前的空间时空动作探测方法将首先使用物体探测器来获取个人候选建议。然后, 该模型将个人候选人分为不同的行动类别。所谓的两阶段方法在现实应用中是沉重的, 很难应用。一些现有的方法使用统一的模型结构, 但是它们与香草模型不相符, 通常需要额外的模块来提升性能。在本文中, 我们探索建立一个终端到终端空间时空动作探测器的战略, 且只有最小的修改。为此, 我们提出了一个新的方法, 名为 ME- STAD, 以端到端的方式解决空间时空动作探测问题。除了模型设计外, 我们提出了一个新的标签战略, 处理空间时空数据集中稀少的描述。提议的ME-STAD 取得了比最初的两阶段探测器更好的结果( 2.2% mAP 推进), 以及大约80% FLOPs 。此外, 我们提议的ME-STAD 将仅对以往的方法进行最低限度的修改, 不需要额外的组件。

0

相关内容

极小点

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

蛋白磷酸酶2A在NO供体诱导肝癌细胞凋亡中的调节作用

国家自然科学基金

0+阅读 · 2015年12月31日

西沙群岛岛质效应的时空特征分析及动力机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

zkscan3基因新功能的解析

国家自然科学基金

0+阅读 · 2014年12月31日

AMPK-Beclin-1/Vps34通路在维生素D3（Vit D)诱导足细胞自噬中的作用和机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

溶酶体组织蛋白酶B参与细胞凋亡机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

Siva蛋白在口蹄疫病毒VP2基因诱导细胞凋亡中的作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

稳态强磁场下细胞凋亡的多基因调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

蓝藻prx基因家族成员功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

靶向干预NF-кB信号通路防治动脉粥样硬化的作用及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于光子晶体慢光效应的硅薄膜太阳能电池光电转换机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

Robust 3D Object Detection in Cold Weather Conditions

Arxiv

0+阅读 · 2022年7月25日

An Exploration of How Training Set Composition Bias in Machine Learning Affects Identifying Rare Objects

Arxiv

0+阅读 · 2022年7月25日

End-to-End Active Speaker Detection

Arxiv

0+阅读 · 2022年7月25日

Focused Decoding Enables 3D Anatomical Detection by Transformers

Arxiv

0+阅读 · 2022年7月21日

DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection

Arxiv

0+阅读 · 2022年7月21日

An Efficient Spatio-Temporal Pyramid Transformer for Action Detection

Arxiv

0+阅读 · 2022年7月21日

Scene Graph Generation: A Comprehensive Survey

Arxiv

26+阅读 · 2022年1月3日

Towards Open World Object Detection

Arxiv

13+阅读 · 2021年3月3日

Prime Sample Attention in Object Detection

Arxiv

13+阅读 · 2019年4月9日

Mobile Video Object Detection with Temporally-Aware Feature Maps

Arxiv

11+阅读 · 2018年3月28日

VIP会员

文章信息

相关主题

Boosting（一种模型训练加速方式）

相关VIP内容

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Robust 3D Object Detection in Cold Weather Conditions

Arxiv

0+阅读 · 2022年7月25日

An Exploration of How Training Set Composition Bias in Machine Learning Affects Identifying Rare Objects

Arxiv

0+阅读 · 2022年7月25日

End-to-End Active Speaker Detection

Arxiv

0+阅读 · 2022年7月25日

Focused Decoding Enables 3D Anatomical Detection by Transformers

Arxiv

0+阅读 · 2022年7月21日

DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection

Arxiv

0+阅读 · 2022年7月21日

An Efficient Spatio-Temporal Pyramid Transformer for Action Detection

Arxiv

0+阅读 · 2022年7月21日

Scene Graph Generation: A Comprehensive Survey

Arxiv

26+阅读 · 2022年1月3日

Towards Open World Object Detection

Arxiv

13+阅读 · 2021年3月3日

Prime Sample Attention in Object Detection

Arxiv

13+阅读 · 2019年4月9日

Mobile Video Object Detection with Temporally-Aware Feature Maps

Arxiv

11+阅读 · 2018年3月28日

相关基金

蛋白磷酸酶2A在NO供体诱导肝癌细胞凋亡中的调节作用

国家自然科学基金

0+阅读 · 2015年12月31日

西沙群岛岛质效应的时空特征分析及动力机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

zkscan3基因新功能的解析

国家自然科学基金

0+阅读 · 2014年12月31日

AMPK-Beclin-1/Vps34通路在维生素D3（Vit D)诱导足细胞自噬中的作用和机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

溶酶体组织蛋白酶B参与细胞凋亡机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

Siva蛋白在口蹄疫病毒VP2基因诱导细胞凋亡中的作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

稳态强磁场下细胞凋亡的多基因调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

蓝藻prx基因家族成员功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

靶向干预NF-кB信号通路防治动脉粥样硬化的作用及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于光子晶体慢光效应的硅薄膜太阳能电池光电转换机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员