查询高效决策,基于无序攻击黑盒深学习模型 (Query Efficient Decision Based Sparse Attacks Against Black-Box Deep Learning Models)

Despite our best efforts, deep learning models remain highly vulnerable to even tiny adversarial perturbations applied to the inputs. The ability to extract information from solely the output of a machine learning model to craft adversarial perturbations to black-box models is a practical threat against real-world systems, such as autonomous cars or machine learning models exposed as a service (MLaaS). Of particular interest are sparse attacks. The realization of sparse attacks in black-box models demonstrates that machine learning models are more vulnerable than we believe. Because these attacks aim to minimize the number of perturbed pixels measured by l_0 norm-required to mislead a model by solely observing the decision (the predicted label) returned to a model query; the so-called decision-based attack setting. But, such an attack leads to an NP-hard optimization problem. We develop an evolution-based algorithm-SparseEvo-for the problem and evaluate against both convolutional deep neural networks and vision transformers. Notably, vision transformers are yet to be investigated under a decision-based attack setting. SparseEvo requires significantly fewer model queries than the state-of-the-art sparse attack Pointwise for both untargeted and targeted attacks. The attack algorithm, although conceptually simple, is also competitive with only a limited query budget against the state-of-the-art gradient-based whitebox attacks in standard computer vision tasks such as ImageNet. Importantly, the query efficient SparseEvo, along with decision-based attacks, in general, raise new questions regarding the safety of deployed systems and poses new directions to study and understand the robustness of machine learning models.

翻译：尽管我们尽了最大努力,深层次的学习模式仍然极易受到对投入的微小对抗性扰动的影响。仅仅从一个机器学习模式的输出中提取信息,将对抗性扰动生成黑盒模式的能力,是对现实世界系统的实际威胁,例如自动汽车或机器学习模式,作为服务(MlaaAS)暴露。特别令人感兴趣的是少发攻击。在黑盒模型中发现少发的攻击表明机器学习模式比我们相信的要脆弱得多。由于这些攻击的目的是通过仅仅观察决定(预测标签)返回到模式查询,从而误导模型;所谓的基于决定的攻击设置。但是,这种攻击导致一个基于NP的硬性汽车或机器学习模式的优化问题。我们针对问题开发了一个基于进化的算法-SparseEvo,并且对革命性的深层神经网络网络和视觉变异器进行评估。值得注意的是,基于标准的视觉变异器尚未在基于决定的攻击设置中被调查。精确的Eprevoral Evo要求模型的查询要大大少于以观察模型来测量攻击的模型,尽管有目标性攻击的精确性攻击, 也只是精确的精确的计算。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/