There is a growing interest in developing unlearnable examples (UEs) against visual privacy leaks on the Internet. UEs are training samples added with invisible but unlearnable noise, which have been found can prevent unauthorized training of machine learning models. UEs typically are generated via a bilevel optimization framework with a surrogate model to remove (minimize) errors from the original samples, and then applied to protect the data against unknown target models. However, existing UE generation methods all rely on an ideal assumption called label-consistency, where the hackers and protectors are assumed to hold the same label for a given sample. In this work, we propose and promote a more practical label-agnostic setting, where the hackers may exploit the protected data quite differently from the protectors. E.g., a m-class unlearnable dataset held by the protector may be exploited by the hacker as a n-class dataset. Existing UE generation methods are rendered ineffective in this challenging setting. To tackle this challenge, we present a novel technique called Unlearnable Clusters (UCs) to generate label-agnostic unlearnable examples with cluster-wise perturbations. Furthermore, we propose to leverage VisionandLanguage Pre-trained Models (VLPMs) like CLIP as the surrogate model to improve the transferability of the crafted UCs to diverse domains. We empirically verify the effectiveness of our proposed approach under a variety of settings with different datasets, target models, and even commercial platforms Microsoft Azure and Baidu PaddlePaddle. Code is available at \url{https://github.com/jiamingzhang94/Unlearnable-Clusters}.
翻译:人们越来越有兴趣开发无法读取的示例(UES),防止互联网上出现视觉隐私泄漏。 UES是用无形但不可读的噪音来培训样本,这些样本可以防止未经授权的机器学习模型培训。 UES通常是通过双级优化框架生成的,其代用模型可以清除原始样本中的(最小化)错误,然后用于保护数据不受未知目标模型的影响。然而,现有的 UE 生成方法都依赖于一个称为标签- 透明性的理想假设,即黑客和保护者假定为某个特定样本持有相同的标签。在此工作中,我们提议并推广一个更实用的标签- 保密性设置,使黑客可以从保护者那里利用受保护的数据。E.g.,黑客可以将一个 m- 级的不可读性数据集用作正级的数据集。在这种挑战性环境下,现有的 UE 生成方法变得无效。为了应对这一挑战,我们提出了一个名为不可读的标签- 库/ 透明性模型(UCUC),我们提出一个名为不可读性分类的C- 的模型,可以生成一个不透明性标签- 透明性标签- LAreal- prevelildal 的C) 。</s>