通过投影梯度估计法进行渐进式边界边界黑盒攻击 (Progressive-Scale Boundary Blackbox Attack via Projective Gradient Estimation)

Boundary based blackbox attack has been recognized as practical and effective, given that an attacker only needs to access the final model prediction. However, the query efficiency of it is in general high especially for high dimensional image data. In this paper, we show that such efficiency highly depends on the scale at which the attack is applied, and attacking at the optimal scale significantly improves the efficiency. In particular, we propose a theoretical framework to analyze and show three key characteristics to improve the query efficiency. We prove that there exists an optimal scale for projective gradient estimation. Our framework also explains the satisfactory performance achieved by existing boundary black-box attacks. Based on our theoretical framework, we propose Progressive-Scale enabled projective Boundary Attack (PSBA) to improve the query efficiency via progressive scaling techniques. In particular, we employ Progressive-GAN to optimize the scale of projections, which we call PSBA-PGAN. We evaluate our approach on both spatial and frequency scales. Extensive experiments on MNIST, CIFAR-10, CelebA, and ImageNet against different models including a real-world face recognition API show that PSBA-PGAN significantly outperforms existing baseline attacks in terms of query efficiency and attack success rate. We also observe relatively stable optimal scales for different models and datasets. The code is publicly available at https://github.com/AI-secure/PSBA.

翻译：基于边界的黑匣子攻击被公认为实际而有效,因为攻击者只需获得最后模型预测即可获得最后模型预测;然而,其查询效率一般而言很高,特别是高维图像数据。我们在本文件中表明,这种效率在很大程度上取决于攻击的施用规模,并以最佳规模进行攻击可大大提高效率。我们特别提议了一个理论框架来分析和显示提高查询效率的三个主要特征。我们证明,投影梯度估计有最佳规模。我们的框架还解释了现有边界黑盒攻击的满意性能。我们根据我们的理论框架,提议采用渐进规模增强的投影性边界攻击(PSA)来通过逐步扩大技术提高查询效率。特别是,我们利用进步GAN来优化预测规模,我们称之为PSA-PAN。我们从空间和频率两个角度评价我们的方法。我们对MNIST、CIFAR-10、CelibA和图像网络的不同模型,包括真实世界的确认,表明PSA-PPAN大大超出现有预测性边界攻击(PSA),我们用渐进式的基线攻击率/比较标准。我们用SBA/SABSBSA标准进行最稳定地衡量。