Skeleton sequences are widely used for action recognition task due to its lightweight and compact characteristics. Recent graph convolutional network (GCN) approaches have achieved great success for skeleton-based action recognition since its grateful modeling ability of non-Euclidean data. GCN is able to utilize the short-range joint dependencies while lack to directly model the distant joints relations that are vital to distinguishing various actions. Thus, many GCN approaches try to employ hierarchical mechanism to aggregate wider-range neighborhood information. We propose a novel self-attention based skeleton-anchor proposal (SAP) module to comprehensively model the internal relations of a human body for motion feature learning. The proposed SAP module aims to explore inherent relationship within human body using a triplet representation via encoding high order angle information rather than the fixed pair-wise bone connection used in the existing hierarchical GCN approaches. A Self-attention based anchor selection method is designed in the proposed SAP module for extracting the root point of encoding angular information. By coupling proposed SAP module with popular spatial-temporal graph neural networks, e.g. MSG3D, it achieves new state-of-the-art accuracy on challenging benchmark datasets. Further ablation study have shown the effectiveness of our proposed SAP module, which is able to obviously improve the performance of many popular skeleton-based action recognition methods.
翻译:近期的图形革命网络(GCN)方法在以骨骼为基础的行动识别方面取得了巨大成功,因为它对非欧洲语言数据具有感恩的建模能力。GCN能够利用短距离联合依赖关系,但又不能直接模拟对区分各种行动至关重要的远程联结关系。因此,许多GCN方法试图利用等级机制来汇总更广泛的周边信息。我们提议了一个基于骨架-锁板建议(SAP)新颖的基于自我注意的模块,以全面模拟人类身体的内部关系,以便进行运动特征学习。SAP模块的目的是通过三重制高调角度信息,而不是现有等级GCN方法中使用的固定双向骨连接,探索人体内部的内在关系。在拟议的SAP模块中设计了一种基于自我关注的锚定选择方法,以提取宽度信息的根点。我们提议的SAP模块将基于广受欢迎的基于空间-时空图形网络的SAP模块(SAP-SAP)组合成模块,例如MSG3D3D,该模块显然具有挑战性地改进了SAP标准化模式,从而改进了我们提出的许多标准化模型。