重新思考神经结构混淆下的白盒深度学习模型水印 (Rethinking White-Box Watermarks on Deep Learning Models under Neural Structural Obfuscation)

Copyright protection for deep neural networks (DNNs) is an urgent need for AI corporations. To trace illegally distributed model copies, DNN watermarking is an emerging technique for embedding and verifying secret identity messages in the prediction behaviors or the model internals. Sacrificing less functionality and involving more knowledge about the target DNN, the latter branch called \textit{white-box DNN watermarking} is believed to be accurate, credible and secure against most known watermark removal attacks, with emerging research efforts in both the academy and the industry. In this paper, we present the first systematic study on how the mainstream white-box DNN watermarks are commonly vulnerable to neural structural obfuscation with \textit{dummy neurons}, a group of neurons which can be added to a target model but leave the model behavior invariant. Devising a comprehensive framework to automatically generate and inject dummy neurons with high stealthiness, our novel attack intensively modifies the architecture of the target model to inhibit the success of watermark verification. With extensive evaluation, our work for the first time shows that nine published watermarking schemes require amendments to their verification procedures.

翻译：深度神经网络(DNNs)的版权保护对于人工智能公司非常紧迫。为了追踪非法分发的模型副本，DNN水印是一种新兴技术，可以在预测行为或模型内部嵌入和验证秘密身份信息。在牺牲较少功能并涉及更多目标DNN的知识的情况下，后者分支叫做白盒DNN水印，被认为是准确、可信和安全的，可以抵御大多数已知的水印消除攻击，在学术界和工业领域都有不断涌现的研究努力。本文提出了第一次系统研究主流白盒DNN水印通常如何容易受到混淆神经结构的攻击，用“虚拟神经元”添加到目标模型中的一组神经元，这些神经元不会改变模型的行为。设计一个全面的框架，自动产生和注入高隐蔽性的虚拟神经元，我们的新型攻击强烈修改目标模型的体系结构，以抑制水印验证的成功。通过大量的评估，我们的工作首次表明，九种已发表的水印方案需要修改它们的验证程序。

相关内容

白盒

关注 0

白盒测试（也称为透明盒测试，玻璃盒测试，透明盒测试和结构测试）是一种软件测试方法，用于测试应用程序的内部结构或功能，而不是其功能（即黑盒测试）。在白盒测试中，系统的内部视角以及编程技能被用来设计测试用例。测试人员选择输入以遍历代码的路径并确定预期的输出。这类似于测试电路中的节点，在线测试（ICT）。白盒测试可以应用于软件测试过程的单元，集成和系统级别。尽管传统的测试人员倾向于将白盒测试视为在单元级别进行的，但如今它已越来越频繁地用于集成和系统测试。它可以测试单元内的路径，集成期间单元之间的路径以及系统级测试期间子系统之间的路径。

【腾讯等】可信赖图学习：可靠性、可解释性和隐私保护，A Survey of Trustworthy Graph Learning: Reliability, Explainability, and Privacy Protection

专知会员服务

20+阅读 · 2022年5月24日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

【CVPR2021教程】计算机视觉中的可解释机器学习

专知会员服务

64+阅读 · 2021年6月22日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日