Existing methods for skeleton-based action recognition mainly focus on improving the recognition accuracy, whereas the efficiency of the model is rarely considered. Recently, there are some works trying to speed up the skeleton modeling by designing light-weight modules. However, in addition to the model size, the amount of the data involved in the calculation is also an important factor for the running speed, especially for the skeleton data where most of the joints are redundant or non-informative to identify a specific skeleton. Besides, previous works usually employ one fix-sized model for all the samples regardless of the difficulty of recognition, which wastes computations for easy samples. To address these limitations, a novel approach, called AdaSGN, is proposed in this paper, which can reduce the computational cost of the inference process by adaptively controlling the input number of the joints of the skeleton on-the-fly. Moreover, it can also adaptively select the optimal model size for each sample to achieve a better trade-off between accuracy and efficiency. We conduct extensive experiments on three challenging datasets, namely, NTU-60, NTU-120 and SHREC, to verify the superiority of the proposed approach, where AdaSGN achieves comparable or even higher performance with much lower GFLOPs compared with the baseline method.
翻译:现有基于骨骼的行动识别方法主要侧重于提高识别准确性,而模型的效率却很少考虑。最近,有些工作试图通过设计轻量模块来加快骨架模型,但除了模型大小外,计算中涉及的数据数量也是运行速度的一个重要因素,特别是骨骼数据,因为大多数连接都是冗余或非信息化的,可以确定具体的骨架。此外,以往的工作通常对所有样本采用一个固定大小的模型,而不论识别困难如何,所有样本都采用一个固定大小的模式,而这种模型的计算是用于简单样品的废物。为了克服这些局限性,本文件提出了一种称为AdaSGN的新办法,这种办法可以通过调整控制机骨骼联合的输入数来降低推断过程的计算成本。此外,它也可以为每个样本选择最佳模型规模,以便在准确性和效率之间实现更好的交易。我们对所有具有挑战性的数据集,即NTU-60、NTU-120和SHREC进行了广泛的实验。为了应对这些局限性的数据集,即ADS-GNS-120和SHRECE,即所谓的AD-GN,以高得多的基质比GM方法达到高得多的优势,从而比GNGNS。