Backdoor-based watermarking schemes were proposed to protect the intellectual property of artificial intelligence models, especially deep neural networks, under the black-box setting. Compared with ordinary backdoors, backdoor-based watermarks need to digitally incorporate the owner's identity, which fact adds extra requirements to the trigger generation and verification programs. Moreover, these concerns produce additional security risks after the watermarking scheme has been published for as a forensics tool or the owner's evidence has been eavesdropped on. This paper proposes the capsulation attack, an efficient method that can invalidate most established backdoor-based watermarking schemes without sacrificing the pirated model's functionality. By encapsulating the deep neural network with a rule-based or Bayes filter, an adversary can block ownership probing and reject the ownership verification. We propose a metric, CAScore, to measure a backdoor-based watermarking scheme's security against the capsulation attack. This paper also proposes a new backdoor-based deep neural network watermarking scheme that is secure against the capsulation attack by reversing the encoding process and randomizing the exposure of triggers.
翻译:与普通的后门或Bayes过滤器相比,后门的水标记需要数字化地纳入所有人的身份,这一事实增加了触发生成和核查程序的额外要求。此外,这些关切在作为法证工具公布水标记办法或业主的证据被偷盗后产生额外的安全风险。本文还提议了封套攻击,这是一种有效的方法,可以使大多数已确立的后门水标记办法无效,而不必牺牲盗版模式的功能。通过将深层神经网络包装成基于规则或Bayes过滤器,敌人可以阻止所有权检验和拒绝所有权核查。我们提议了一种衡量标准,即CAScore,以衡量以后门为主的水标记办法对顶部攻击的安全性。本文还提议了一种新的基于后门的深层神经网水标记办法,通过扭转编码过程和随机地设定触发触发的触发器,来防范封套攻击。