模拟查询有效黑盒袭击的未知目标模型 (Simulating Unknown Target Models for Query-Efficient Black-box Attacks)

Many adversarial attacks have been proposed to investigate the security issues of deep neural networks. In the black-box setting, current model stealing attacks train a substitute model to counterfeit the functionality of the target model. However, the training requires querying the target model. Consequently, the query complexity remains high, and such attacks can be defended easily. This study aims to train a generalized substitute model called "Simulator", which can mimic the functionality of any unknown target model. To this end, we build the training data with the form of multiple tasks by collecting query sequences generated during the attacks of various existing networks. The learning process uses a mean square error-based knowledge-distillation loss in the meta-learning to minimize the difference between the Simulator and the sampled networks. The meta-gradients of this loss are then computed and accumulated from multiple tasks to update the Simulator and subsequently improve generalization. When attacking a target model that is unseen in training, the trained Simulator can accurately simulate its functionality using its limited feedback. As a result, a large fraction of queries can be transferred to the Simulator, thereby reducing query complexity. Results of the comprehensive experiments conducted using the CIFAR-10, CIFAR-100, and TinyImageNet datasets demonstrate that the proposed approach reduces query complexity by several orders of magnitude compared to the baseline method. The implementation source code is released at https://github.com/machanic/SimulatorAttack.

翻译：提议了许多对抗性攻击,以调查深神经网络的安全问题。在黑箱设置中,目前的偷窃式攻击模式用替代模型来模拟目标模型的功能。但是,培训需要询问目标模型。因此,查询的复杂性仍然很高,这种攻击可以很容易地进行辩护。这项研究的目的是训练一个名为“模拟器”的通用替代模型,该模型可以模仿任何未知目标模型的功能。为此,我们通过收集在各种现有网络袭击期间产生的查询序列,以多种任务的形式建立培训数据。学习过程在元学习中使用一个平均平方位基于错误的知识蒸馏模型,以尽量减少模拟器和抽样网络之间的差异。随后,从多项任务中计算和积累了这种损失的元变异性,以更新模拟器,并随后改进一般化。在攻击一个在培训中看不见的目标模型时,训练有素的模拟器可以使用有限的反馈来准确模拟其功能。因此,大量查询可以转移到模拟器,从而降低查询的复杂性,从而降低查询的精度。使用IMAR-10号进行的全面实验的结果是,使用拟议的CIAR-10号基准方法,通过拟议的数字序列,以显示IMAR-10号的进度方法,以显示全面的试验结果,以显示数字-10号