学习黑牛随机搜索基地对准攻击的搜索分布 (Meta-Learning the Search Distribution of Black-Box Random Search Based Adversarial Attacks)

Adversarial attacks based on randomized search schemes have obtained state-of-the-art results in black-box robustness evaluation recently. However, as we demonstrate in this work, their efficiency in different query budget regimes depends on manual design and heuristic tuning of the underlying proposal distributions. We study how this issue can be addressed by adapting the proposal distribution online based on the information obtained during the attack. We consider Square Attack, which is a state-of-the-art score-based black-box attack, and demonstrate how its performance can be improved by a learned controller that adjusts the parameters of the proposal distribution online during the attack. We train the controller using gradient-based end-to-end training on a CIFAR10 model with white box access. We demonstrate that plugging the learned controller into the attack consistently improves its black-box robustness estimate in different query regimes by up to 20% for a wide range of different models with black-box access. We further show that the learned adaptation principle transfers well to the other data distributions such as CIFAR100 or ImageNet and to the targeted attack setting.

翻译：以随机搜索计划为基础的Aversarial攻击最近通过黑盒稳健性评估获得了最新的最新结果。然而,正如我们在这项工作中所显示的那样,不同查询预算制度的效率取决于手动设计和对基本建议分布的过度调整。我们研究如何通过根据攻击期间获得的信息在网上修改提案分发方法解决这一问题。我们考虑Square attack,这是以分数为基础的最先进的黑盒攻击,并表明如何由一个在攻击期间调整在线分配建议书参数的有学控制器改进其性能。我们用基于梯度的终端到终端培训控制器,在使用白箱访问的CIFAR10模型上进行。我们表明,将学习到的控制器插入攻击中,不断提高在不同查询制度中的黑盒稳性估计值,最高达20%,用于使用黑盒访问的范围广泛的不同模型。我们进一步显示,学习到的适应原则向诸如CIFAR100或图像网等其他数据传播方式以及目标攻击设置。

相关内容

黑盒

关注 1

在科学，计算和工程学中，黑盒是一种设备，系统或对象，可以根据其输入和输出（或传输特性）对其进行查看，而无需对其内部工作有任何了解。它的实现是“不透明的”（黑色）。几乎任何事物都可以被称为黑盒：晶体管，引擎，算法，人脑，机构或政府。为了使用典型的“黑匣子方法”来分析建模为开放系统的事物，仅考虑刺激/响应的行为，以推断（未知）盒子。该黑匣子系统的通常表示形式是在该方框中居中的数据流程图。黑盒的对立面是一个内部组件或逻辑可用于检查的系统，通常将其称为白盒（有时也称为“透明盒”或“玻璃盒”）。

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

元学习(meta learning) 最新进展综述论文

专知会员服务

281+阅读 · 2020年5月8日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

【普林斯顿大学-微软】加权元学习，Weighted Meta-Learning

专知会员服务

40+阅读 · 2020年3月25日