A significant amount of work has been done on adversarial attacks that inject imperceptible noise to images to deteriorate the image classification performance of deep models. However, most of the existing studies consider attacks in the digital (pixel) domain where an image acquired by an image sensor with sampling and quantization has been recorded. This paper, for the first time, introduces an optical adversarial attack, which physically alters the light field information arriving at the image sensor so that the classification model yields misclassification. More specifically, we modulate the phase of the light in the Fourier domain using a spatial light modulator placed in the photographic system. The operative parameters of the modulator are obtained by gradient-based optimization to maximize cross-entropy and minimize distortions. We present experiments based on both simulation and a real hardware optical system, from which the feasibility of the proposed optical attack is demonstrated. It is also verified that the proposed attack is completely different from common optical-domain distortions such as spherical aberration, defocus, and astigmatism in terms of both perturbation patterns and classification results.
翻译:在对抗性攻击方面已经做了大量的工作,这种攻击对图像注入了无法察觉的噪音,使深层模型的图像分类性能恶化。然而,大多数现有研究都考虑到数字(像素)领域的攻击,因为通过取样和量化录制的图像传感器获得的图像已经记录下来。本文首次引入了光学对抗性攻击,实际改变了到达图像传感器的光场信息,使分类模型产生错误的分类。更具体地说,我们使用在摄影系统中放置的空间光模调器来调节Fourier域的光相阶段。通过基于梯度的优化来获取调制器的操作参数,以尽量扩大交叉湿度和尽量减少扭曲。我们介绍了基于模拟和真正的硬件光学系统的实验,从中可以证明拟议的光攻击的可行性。还核实,拟议的攻击与一般光学-表面扭曲完全不同,例如球形畸变、脱焦、以及从孔径和分类结果两方面来说都是光学的。