In this study, we introduce a measure for machine perception, inspired by the concept of Just Noticeable Difference (JND) of human perception. Based on this measure, we suggest an adversarial image generation algorithm, which iteratively distorts an image by an additive noise until the model detects the change in the image by outputting a false label. The noise added to the original image is defined as the gradient of the cost function of the model. A novel cost function is defined to explicitly minimize the amount of perturbation applied to the input image while enforcing the perceptual similarity between the adversarial and input images. For this purpose, the cost function is regularized by the well-known total variation and bounded range terms to meet the natural appearance of the adversarial image. We evaluate the adversarial images generated by our algorithm both qualitatively and quantitatively on CIFAR10, ImageNet, and MS COCO datasets. Our experiments on image classification and object detection tasks show that adversarial images generated by our JND method are both more successful in deceiving the recognition/detection models and less perturbed compared to the images generated by the state-of-the-art methods, namely, FGV, FSGM, and DeepFool methods.
翻译:在本研究中,我们引入了一种机器感知措施,其灵感来自对人体感知的“可察觉差异”(JND)概念。根据这一措施,我们建议采用对抗性图像生成算法,在模型通过输出假标签检测图像变化之前,用添加噪音反复扭曲图像,直到模型通过输出假标签发现图像变化。原始图像中添加的噪音被定义为模型成本函数的梯度。新颖的成本功能被定义为明确将输入图像的扰动量最小化,同时实施对称图像和输入图像之间的概念相似性。为此,成本函数由众所周知的总体变异和约束范围术语规范化,以满足对称图像的自然外观。我们从质量和数量上评价我们的算法在CIRFAR10、图像网和MS COCO数据集上产生的对抗图像。我们在图像分类和对象检测任务方面的实验表明,我们的JND方法产生的对抗性图像在认识/探测模型和输入图像方面比较成功,而且与州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州-州