Capsule network is a type of neural network that uses the spatial relationship between features to classify images. By capturing the poses and relative positions between features, its ability to recognize affine transformation is improved, and it surpasses traditional convolutional neural networks (CNNs) when handling translation, rotation and scaling. The Stacked Capsule Autoencoder (SCAE) is the state-of-the-art capsule network. The SCAE encodes an image as capsules, each of which contains poses of features and their correlations. The encoded contents are then input into the downstream classifier to predict the categories of the images. Existing research mainly focuses on the security of capsule networks with dynamic routing or EM routing, and little attention has been given to the security and robustness of the SCAE. In this paper, we propose an evasion attack against the SCAE. After a perturbation is generated with an optimization algorithm, it is added to an image to reduce the output of the capsules related to the original category of the image. As the contribution of these capsules to the original class is reduced, the perturbed image will be misclassified. We evaluate the attack using an image classification experiment, and the experimental results indicate that our attack can achieve an approximately 99% success rate. We hope that our work will make the community aware of the threat of this attack and raise the attention given to SCAE security.
翻译:Capsule 网络是一种神经网络,它使用各种功能之间的空间关系来对图像进行分类。 通过捕捉各功能之间的配置和相对位置, 其辨别松动变异的能力得到提高, 并且超过处理翻译、 旋转和缩放的传统神经网络(CNNs) 。 Stacked Capsule Autoencoder (SCAE) 是状态先进的胶囊网络。 SCAE 将一个图像编码成胶囊, 其中每个胶囊都包含特征和相关性。 编码的内容随后被输入下游分类器, 以预测图像的类别。 现有的研究主要侧重于带有动态路由或EM路由的胶囊网络的安全性, 并且很少关注 SCAE 的安全和稳健性。 在本文中, 我们建议对 SCAE 进行规避攻击。 在以最优化算法生成了扰动后, 添加了一个图像来减少胶囊的输出。 与图像原始类别有关的输出。 由于这些胶囊对原始分类的贡献, 现有胶囊对带有动态路由或EM路由, 我们的实验级的实验率将降低了我们攻击率的图像的图像。 我们的实验率将降低了攻击率 。 我们的图像将降低了我们攻击率 。