关于基于骨干行动确认的基于自我注意的锚定提案 (Self-attention based anchor proposal for skeleton-based action recognition) - 专知论文

会员服务 ·

0

图卷积网络 · anchor · 图卷积神经网络/图卷积网络 · MoDELS · Obvious ·

2021 年 12 月 17 日

Self-attention based anchor proposal for skeleton-based action recognition

翻译：关于基于骨干行动确认的基于自我注意的锚定提案

Ruijie Hou,Zhao Wang

Skeleton sequences are widely used for action recognition task due to its lightweight and compact characteristics. Recent graph convolutional network (GCN) approaches have achieved great success for skeleton-based action recognition since its grateful modeling ability of non-Euclidean data. GCN is able to utilize the short-range joint dependencies while lack to directly model the distant joints relations that are vital to distinguishing various actions. Thus, many GCN approaches try to employ hierarchical mechanism to aggregate wider-range neighborhood information. We propose a novel self-attention based skeleton-anchor proposal (SAP) module to comprehensively model the internal relations of a human body for motion feature learning. The proposed SAP module aims to explore inherent relationship within human body using a triplet representation via encoding high order angle information rather than the fixed pair-wise bone connection used in the existing hierarchical GCN approaches. A Self-attention based anchor selection method is designed in the proposed SAP module for extracting the root point of encoding angular information. By coupling proposed SAP module with popular spatial-temporal graph neural networks, e.g. MSG3D, it achieves new state-of-the-art accuracy on challenging benchmark datasets. Further ablation study have shown the effectiveness of our proposed SAP module, which is able to obviously improve the performance of many popular skeleton-based action recognition methods.

翻译：近期的图形革命网络(GCN)方法在以骨骼为基础的行动识别方面取得了巨大成功,因为它对非欧洲语言数据具有感恩的建模能力。GCN能够利用短距离联合依赖关系,但又不能直接模拟对区分各种行动至关重要的远程联结关系。因此,许多GCN方法试图利用等级机制来汇总更广泛的周边信息。我们提议了一个基于骨架-锁板建议(SAP)新颖的基于自我注意的模块,以全面模拟人类身体的内部关系,以便进行运动特征学习。SAP模块的目的是通过三重制高调角度信息,而不是现有等级GCN方法中使用的固定双向骨连接,探索人体内部的内在关系。在拟议的SAP模块中设计了一种基于自我关注的锚定选择方法,以提取宽度信息的根点。我们提议的SAP模块将基于广受欢迎的基于空间-时空图形网络的SAP模块(SAP-SAP)组合成模块,例如MSG3D3D,该模块显然具有挑战性地改进了SAP标准化模式,从而改进了我们提出的许多标准化模型。

0

相关内容

图卷积网络

图卷积网络

图卷积网络（简称GCN），由Thomas Kpif于2017年在论文Semi-supervised classification with graph convolutional networks中提出。它为图（graph）结构数据的处理提供了一个崭新的思路，将深度学习中常用于图像的卷积神经网络应用到图数据上。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

【PKDD 2021】PaGNN：基于交互结构学习的链路预测

【PKDD 2021】PaGNN：基于交互结构学习的链路预测

专知会员服务

18+阅读 · 2021年11月26日

【硬核书】Linux核心编程|Linux Kernel Programming，741页pdf

【硬核书】Linux核心编程|Linux Kernel Programming，741页pdf

专知会员服务

80+阅读 · 2021年3月26日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

商业数据分析，39页ppt

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

【硬核书】金融数学C++编程，411页pdf，C++ for Financial Mathematics

【硬核书】金融数学C++编程，411页pdf，C++ for Financial Mathematics

专知会员服务

75+阅读 · 2020年4月6日

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

专知会员服务

51+阅读 · 2020年3月17日

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

专知会员服务

136+阅读 · 2020年3月8日

【WSDM 2020 论文】基于自关注网络的动态图表示学习（Dynamic graph representation learning via self-attention networks），Visa Research的研究员武延宏等

【WSDM 2020 论文】基于自关注网络的动态图表示学习（Dynamic graph representation learning via self-attention networks），Visa Research的研究员武延宏等

专知会员服务

98+阅读 · 2019年11月20日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

从三大顶会论文看百变Self-Attention

从三大顶会论文看百变Self-Attention

PaperWeekly

17+阅读 · 2019年11月11日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

简评 | Video Action Recognition 的近期进展

简评 | Video Action Recognition 的近期进展

极市平台

20+阅读 · 2019年4月21日

商研丨实例分割的进阶三级跳：从Mask R-CNN到Hybrid Task Cascade

商研丨实例分割的进阶三级跳：从Mask R-CNN到Hybrid Task Cascade

商汤科技

3+阅读 · 2019年3月21日

【干货】实例分割的进阶三级跳：从 Mask R-CNN 到 Hybrid Task Cascade

【干货】实例分割的进阶三级跳：从 Mask R-CNN 到 Hybrid Task Cascade

GAN生成式对抗网络

8+阅读 · 2019年3月14日

【泡泡一分钟】基于时间滑动LSTM网络的基于骨架动作识别（ICCV2017-106）

【泡泡一分钟】基于时间滑动LSTM网络的基于骨架动作识别（ICCV2017-106）

泡泡机器人SLAM

5+阅读 · 2018年9月27日

【泡泡一分钟】基于注意力机制的深度网络HydraPlus-Net(ICCV2017-34)

【泡泡一分钟】基于注意力机制的深度网络HydraPlus-Net(ICCV2017-34)

泡泡机器人SLAM

8+阅读 · 2018年6月9日

笔记 | Deep active learning for named entity recognition

笔记 | Deep active learning for named entity recognition

黑龙江大学自然语言处理实验室

24+阅读 · 2018年5月27日

Mask R-CNN 论文笔记

Mask R-CNN 论文笔记

统计学习与视觉计算组

11+阅读 · 2018年3月22日

【论文】【论文】王晓刚老师课题组ICCV2017论文：学习特征金字塔用于人体姿态估计（附代码）

【论文】【论文】王晓刚老师课题组ICCV2017论文：学习特征金字塔用于人体姿态估计（附代码）

机器学习研究会

6+阅读 · 2017年8月5日

Facial Expression Recognition with Visual Transformers and Attentional Selective Fusion

Arxiv

0+阅读 · 2022年2月22日

Graph Spring Network and Informative Anchor Selection for Session-based Recommendation

Arxiv

0+阅读 · 2022年2月19日

Conditional Local Convolution for Spatio-temporal Meteorological Forecasting

Arxiv

10+阅读 · 2021年12月2日

OadTR: Online Action Detection with Transformers

Arxiv

7+阅读 · 2021年6月21日

Graph Convolutional Networks for Temporal Action Localization

Arxiv

5+阅读 · 2019年9月7日

Scene-based Factored Attention for Image Captioning

Arxiv

4+阅读 · 2019年8月7日

An Attention Enhanced Graph Convolutional LSTM Network for Skeleton-Based Action Recognition

Arxiv

9+阅读 · 2019年3月29日

Progressive Sparse Local Attention for Video object detection

Arxiv

4+阅读 · 2019年3月21日

Learning Representative Temporal Features for Action Recognition

Arxiv

4+阅读 · 2018年3月14日

A Unified Method for First and Third Person Action Recognition

Arxiv

3+阅读 · 2017年12月30日

VIP会员

文章信息

相关主题

图卷积网络

图卷积神经网络/图卷积网络

相关VIP内容

【PKDD 2021】PaGNN：基于交互结构学习的链路预测

【PKDD 2021】PaGNN：基于交互结构学习的链路预测

专知会员服务

18+阅读 · 2021年11月26日

【硬核书】Linux核心编程|Linux Kernel Programming，741页pdf

【硬核书】Linux核心编程|Linux Kernel Programming，741页pdf

专知会员服务

80+阅读 · 2021年3月26日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

商业数据分析，39页ppt

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

【硬核书】金融数学C++编程，411页pdf，C++ for Financial Mathematics

【硬核书】金融数学C++编程，411页pdf，C++ for Financial Mathematics

专知会员服务

75+阅读 · 2020年4月6日

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

专知会员服务

51+阅读 · 2020年3月17日

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

专知会员服务

136+阅读 · 2020年3月8日

【WSDM 2020 论文】基于自关注网络的动态图表示学习（Dynamic graph representation learning via self-attention networks），Visa Research的研究员武延宏等

【WSDM 2020 论文】基于自关注网络的动态图表示学习（Dynamic graph representation learning via self-attention networks），Visa Research的研究员武延宏等

专知会员服务

98+阅读 · 2019年11月20日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】在低维与高维空间中对潜在表征的分析、建模与变换

《美军使用大语言模型技术生成领域特定文档》2025最新379页

【NeurIPS 2025】以语言为中心的全模态表征学习的可扩展性研究

智能体化多模态大语言模型综述

相关资讯

从三大顶会论文看百变Self-Attention

从三大顶会论文看百变Self-Attention

PaperWeekly

17+阅读 · 2019年11月11日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

简评 | Video Action Recognition 的近期进展

简评 | Video Action Recognition 的近期进展

极市平台

20+阅读 · 2019年4月21日

商研丨实例分割的进阶三级跳：从Mask R-CNN到Hybrid Task Cascade

商研丨实例分割的进阶三级跳：从Mask R-CNN到Hybrid Task Cascade

商汤科技

3+阅读 · 2019年3月21日

【干货】实例分割的进阶三级跳：从 Mask R-CNN 到 Hybrid Task Cascade

【干货】实例分割的进阶三级跳：从 Mask R-CNN 到 Hybrid Task Cascade

GAN生成式对抗网络

8+阅读 · 2019年3月14日

【泡泡一分钟】基于时间滑动LSTM网络的基于骨架动作识别（ICCV2017-106）

【泡泡一分钟】基于时间滑动LSTM网络的基于骨架动作识别（ICCV2017-106）

泡泡机器人SLAM

5+阅读 · 2018年9月27日

【泡泡一分钟】基于注意力机制的深度网络HydraPlus-Net(ICCV2017-34)

【泡泡一分钟】基于注意力机制的深度网络HydraPlus-Net(ICCV2017-34)

泡泡机器人SLAM

8+阅读 · 2018年6月9日

笔记 | Deep active learning for named entity recognition

笔记 | Deep active learning for named entity recognition

黑龙江大学自然语言处理实验室

24+阅读 · 2018年5月27日

Mask R-CNN 论文笔记

Mask R-CNN 论文笔记

统计学习与视觉计算组

11+阅读 · 2018年3月22日

【论文】【论文】王晓刚老师课题组ICCV2017论文：学习特征金字塔用于人体姿态估计（附代码）

【论文】【论文】王晓刚老师课题组ICCV2017论文：学习特征金字塔用于人体姿态估计（附代码）

机器学习研究会

6+阅读 · 2017年8月5日

相关论文

Facial Expression Recognition with Visual Transformers and Attentional Selective Fusion

Arxiv

0+阅读 · 2022年2月22日

Graph Spring Network and Informative Anchor Selection for Session-based Recommendation

Arxiv

0+阅读 · 2022年2月19日

Conditional Local Convolution for Spatio-temporal Meteorological Forecasting

Arxiv

10+阅读 · 2021年12月2日

OadTR: Online Action Detection with Transformers

Arxiv

7+阅读 · 2021年6月21日

Graph Convolutional Networks for Temporal Action Localization

Arxiv

5+阅读 · 2019年9月7日

Scene-based Factored Attention for Image Captioning

Arxiv

4+阅读 · 2019年8月7日

An Attention Enhanced Graph Convolutional LSTM Network for Skeleton-Based Action Recognition

Arxiv

9+阅读 · 2019年3月29日

Progressive Sparse Local Attention for Video object detection

Arxiv

4+阅读 · 2019年3月21日

Learning Representative Temporal Features for Action Recognition

Arxiv

4+阅读 · 2018年3月14日

A Unified Method for First and Third Person Action Recognition

Arxiv

3+阅读 · 2017年12月30日

微信扫码咨询专知VIP会员