To explore the vulnerability of deep neural networks (DNNs), many attack paradigms have been well studied, such as the poisoning-based backdoor attack in the training stage and the adversarial attack in the inference stage. In this paper, we study a novel attack paradigm, which modifies model parameters in the deployment stage. Considering the effectiveness and stealthiness goals, we provide a general formulation to perform the bit-flip based weight attack, where the effectiveness term could be customized depending on the attacker's purpose. Furthermore, we present two cases of the general formulation with different malicious purposes, i.e., single sample attack (SSA) and triggered samples attack (TSA). To this end, we formulate this problem as a mixed integer programming (MIP) to jointly determine the state of the binary bits (0 or 1) in the memory and learn the sample modification. Utilizing the latest technique in integer programming, we equivalently reformulate this MIP problem as a continuous optimization problem, which can be effectively and efficiently solved using the alternating direction method of multipliers (ADMM) method. Consequently, the flipped critical bits can be easily determined through optimization, rather than using a heuristic strategy. Extensive experiments demonstrate the superiority of SSA and TSA in attacking DNNs.
翻译:为了探索深层神经网络的脆弱性,对许多攻击范式进行了认真的研究,例如训练阶段的以中毒为基础的后门攻击和推论阶段的对抗性攻击。在本文中,我们研究了一种新型攻击范式,它改变了部署阶段的模型参数。考虑到有效性和隐形目标,我们为进行以位翻为基础的重量攻击提供了一种通用的配方,根据攻击者的目的,可以对有效性术语进行定制。此外,我们提出了两种具有不同恶意目的的一般配方,即单一样本攻击和触发样品攻击。为此,我们将这一问题编成一种混合整形程序(MIP),以共同确定记忆中的二元点(0或1)的状态,并学习样本修改。我们利用最新的整形方案技术,将MIP问题改写成一种连续优化问题,用乘数交替方向方法(ADMMM)来有效解决。因此,在最优化战略中,反动的临界位数可以通过MISA来快速地测试,而不是使用他国空的MISA战略。