Skeleton-based action recognition models have recently been shown to be vulnerable to adversarial attacks. Compared to adversarial attacks on images, perturbations to skeletons are typically bounded to a lower dimension of approximately 100 per frame. This lower-dimensional setting makes it more difficult to generate imperceptible perturbations. Existing attacks resolve this by exploiting the temporal structure of the skeleton motion so that the perturbation dimension increases to thousands. In this paper, we show that adversarial attacks can be performed on skeleton-based action recognition models, even in a significantly low-dimensional setting without any temporal manipulation. Specifically, we restrict the perturbations to the lengths of the skeleton's bones, which allows an adversary to manipulate only approximately 30 effective dimensions. We conducted experiments on the NTU RGB+D and HDM05 datasets and demonstrate that the proposed attack successfully deceived models with sometimes greater than 90\% success rate by small perturbations. Furthermore, we discovered an interesting phenomenon: in our low-dimensional setting, the adversarial training with the bone length attack shares a similar property with data augmentation, and it not only improves the adversarial robustness but also improves the classification accuracy on the original original data. This is an interesting counterexample of the trade-off between adversarial robustness and clean accuracy, which has been widely observed in studies on adversarial training in the high-dimensional regime.
翻译:以皮肤为基础的行动识别模型最近被证明很容易受到对抗性攻击。 与对图像的对抗性攻击相比,对骨骼的扰动通常与每框架约100个低维相联。 这种低维设置使得更难产生不易察觉的扰动。 现有的攻击通过利用骨骼运动的时间结构,使扰动的维度增加至数千个,从而解决这个问题。 在本文中,我们显示,对立性攻击可以在基于骨骼的行动识别模型上进行,甚至在没有时间操纵的显著低维度环境中进行。 具体地说,我们把对骨骼的扰动限制在骨骼的长度上,使得对手只能操纵大约30个有效维。我们在NTU RGB+D和HDMD05数据集上进行了实验,并表明拟议的攻击成功地欺骗了模型,有时通过小扰动率超过90 ⁇ 的成功率。 此外,我们发现了一个有趣的现象:在我们的低维度环境中,对骨骼攻击的对立性训练与对立性攻击有着相似的属性,与原始数据增强的特性是相同的特性,而它不仅使对手能够操纵的对立性的研究更加精确,而且在对立性的研究中也提高了对立性分析。