Video classification systems are vulnerable to adversarial attacks, which can create severe security problems in video verification. Current black-box attacks need a large number of queries to succeed, resulting in high computational overhead in the process of attack. On the other hand, attacks with restricted perturbations are ineffective against defenses such as denoising or adversarial training. In this paper, we focus on unrestricted perturbations and propose StyleFool, a black-box video adversarial attack via style transfer to fool the video classification system. StyleFool first utilizes color theme proximity to select the best style image, which helps avoid unnatural details in the stylized videos. Meanwhile, the target class confidence is additionally considered in targeted attacks to influence the output distribution of the classifier by moving the stylized video closer to or even across the decision boundary. A gradient-free method is then employed to further optimize the adversarial perturbations. We carry out extensive experiments to evaluate StyleFool on two standard datasets, UCF-101 and HMDB-51. The experimental results demonstrate that StyleFool outperforms the state-of-the-art adversarial attacks in terms of both the number of queries and the robustness against existing defenses. Moreover, 50% of the stylized videos in untargeted attacks do not need any query since they can already fool the video classification model. Furthermore, we evaluate the indistinguishability through a user study to show that the adversarial samples of StyleFool look imperceptible to human eyes, despite unrestricted perturbations.
翻译:视频分类系统容易受到对抗性攻击,这可能会在视频验证中造成严重的安全问题。 当前的黑盒攻击需要大量查询才能成功, 从而导致攻击过程中的高计算成本。 另一方面, 限制干扰攻击对防御性攻击是无效的, 比如降音或对抗性训练。 在本文中, 我们侧重于无限制的扰动, 并提议StyleFool, 这是一种黑盒对抗性攻击性攻击, 通过风格传输来愚弄视频分类系统。 StyfFool 首先利用颜色主题接近来选择最佳风格图像, 这有助于避免超自然的细节。 同时, 目标阶级信心在定向攻击中被进一步考虑, 以影响分类器的输出分布, 从而影响分解性视频在决定界限上更近甚至跨过。 然后, 我们使用一种无梯度的方法来进一步优化对抗性视频扰动性攻击。 我们在两个标准数据集( UCF-101 和 HMDB- 51) 上用彩色眼睛来评价Styfol 。 实验结果显示, StreyFol 已经超越了50级的硬性攻击性攻击性攻击的准确性, 。