未定向后门水印：实现无害且隐秘的数据集版权保护 (Untargeted Backdoor Watermark: Towards Harmless and Stealthy Dataset Copyright Protection)

from arxiv, This work is accepted by the NeurIPS 2022 (Oral, TOP 2%). The first two authors contributed equally to this work. 25 pages. We have fixed some typos in the previous version

Deep neural networks (DNNs) have demonstrated their superiority in practice. Arguably, the rapid development of DNNs is largely benefited from high-quality (open-sourced) datasets, based on which researchers and developers can easily evaluate and improve their learning methods. Since the data collection is usually time-consuming or even expensive, how to protect their copyrights is of great significance and worth further exploration. In this paper, we revisit dataset ownership verification. We find that existing verification methods introduced new security risks in DNNs trained on the protected dataset, due to the targeted nature of poison-only backdoor watermarks. To alleviate this problem, in this work, we explore the untargeted backdoor watermarking scheme, where the abnormal model behaviors are not deterministic. Specifically, we introduce two dispersibilities and prove their correlation, based on which we design the untargeted backdoor watermark under both poisoned-label and clean-label settings. We also discuss how to use the proposed untargeted backdoor watermark for dataset ownership verification. Experiments on benchmark datasets verify the effectiveness of our methods and their resistance to existing backdoor defenses. Our codes are available at \url{https://github.com/THUYimingLi/Untargeted_Backdoor_Watermark}.

翻译：深度神经网络（DNNs）已经在实践中证明了它们的优越性。可以说，DNNs的快速发展在很大程度上得益于高质量（开源）数据集，基于这些数据集，研究人员和开发者可以轻松地评估和改进他们的学习方法。由于数据集的收集通常是耗时甚至昂贵的，如何保护它们的版权具有重大意义，值得进一步探讨。在本文中，我们重新审视了数据集所有权验证。我们发现，由于以毒性标签为唯一后门水印的定向本质，现有的验证方法在受保护的数据集上训练的DNNs中引入了新的安全风险。为减轻这个问题，本文中我们探讨了未定向后门水印方案，其中异常的模型行为是不确定的。具体地，我们介绍了两种分散度并证明了它们的相关性，基于此设计了在受污染标签和清洁标签设置下的未定向后门水印。我们还讨论了如何使用所提出的未定向后门水印进行数据集所有权验证。对基准数据集的实验验证了我们的方法的有效性以及它们对现有后门防御的抵抗力。我们的代码可在\url{https://github.com/THUYimingLi/Untargeted_Backdoor_Watermark}获得。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

清华大学崔鹏：可信智能决策框架及实践

专知会员服务

75+阅读 · 2023年5月1日

【书籍】网络安全《移动目标防御 II：博弈论和对抗性建模的应用》210页，Moving Target Defense II：Application of Game Theory and Adversarial Modeling

专知会员服务

66+阅读 · 2022年4月14日

预训练模型如何用于文本挖掘？看这份KDD2021-UIUC《预训练文本表示:模型与应用在文本挖掘》教程，附200页Slides

专知会员服务

44+阅读 · 2021年8月18日

【ICLR2021】神经元注意力蒸馏消除DNN中的后门触发器

专知会员服务

15+阅读 · 2021年1月31日