There is a growing interest in developing unlearnable examples (UEs) against visual privacy leaks on the Internet. UEs are training samples added with invisible but unlearnable noise, which have been found can prevent unauthorized training of machine learning models. UEs typically are generated via a bilevel optimization framework with a surrogate model to remove (minimize) errors from the original samples, and then applied to protect the data against unknown target models. However, existing UE generation methods all rely on an ideal assumption called label-consistency, where the hackers and protectors are assumed to hold the same label for a given sample. In this work, we propose and promote a more practical label-agnostic setting, where the hackers may exploit the protected data quite differently from the protectors. E.g., a m-class unlearnable dataset held by the protector may be exploited by the hacker as a n-class dataset. Existing UE generation methods are rendered ineffective in this challenging setting. To tackle this challenge, we present a novel technique called Unlearnable Clusters (UCs) to generate label-agnostic unlearnable examples with cluster-wise perturbations. Furthermore, we propose to leverage VisionandLanguage Pre-trained Models (VLPMs) like CLIP as the surrogate model to improve the transferability of the crafted UCs to diverse domains. We empirically verify the effectiveness of our proposed approach under a variety of settings with different datasets, target models, and even commercial platforms Microsoft Azure and Baidu PaddlePaddle.
翻译:对开发防止互联网上视觉隐私泄漏的不可读实例(UES)的兴趣日益浓厚。 UES是用无形但不可读的噪音来培训样本,这些样本可以防止未经授权的机器学习模型培训。 URES通常通过双级优化框架生成,以替代模型去除(最小化)原始样本中的错误,然后用于保护数据不受未知目标模型的影响。 但是,现有的 UE生成方法都依赖于一个叫作标签多样性一致性的理想假设,即黑客和保护者假定为某个样本持有相同的标签。 在这项工作中,我们提议并推广一个更实用的标签-认知性设置,使黑客利用受保护数据的方式与保护者不同。 如,黑客可以将一个 m-级的不可读性数据集用作正级数据集。在这个挑战性环境中,现有的 UE 生成方法变得无效。 为了应对这一挑战,我们提出了一个名为不可复制的分类集(UC)的新型技术, 并用我们无法理解的分类的C 来生成标签- LVE 工具, 以及我们无法理解的图像- LAreal- Preal 的模型, 和我们提出一个无法理解的图像- 的模型, 。