The intellectual property (IP) of Deep neural networks (DNNs) can be easily ``stolen'' by surrogate model attack. There has been significant progress in solutions to protect the IP of DNN models in classification tasks. However, little attention has been devoted to the protection of DNNs in image processing tasks. By utilizing consistent invisible spatial watermarks, one recent work first considered model watermarking for deep image processing networks and demonstrated its efficacy in many downstream tasks. Nevertheless, it highly depends on the hypothesis that the embedded watermarks in the network outputs are consistent. When the attacker uses some common data augmentation attacks (e.g., rotate, crop, and resize) during surrogate model training, it will totally fail because the underlying watermark consistency is destroyed. To mitigate this issue, we propose a new watermarking methodology, namely ``structure consistency'', based on which a new deep structure-aligned model watermarking algorithm is designed. Specifically, the embedded watermarks are designed to be aligned with physically consistent image structures, such as edges or semantic regions. Experiments demonstrate that our method is much more robust than the baseline method in resisting data augmentation attacks for model IP protection. Besides that, we further test the generalization ability and robustness of our method to a broader range of circumvention attacks.
翻译:深海神经网络(DNN)的知识产权(IP) 很容易被替代模型攻击而“ 吞没 ” 。 在分类任务中,在保护 DNN 模型的IP 方面取得了显著进展。 但是,在图像处理任务中,对保护 DNN 模型的解决方案没有多少注意。 通过使用一致的隐性空间水标记,最近的一项工作首先考虑为深图像处理网络设计示范水标记,并在许多下游任务中展示其效力。然而,这在很大程度上取决于以下假设:网络产出中的嵌入水标记是一致的。当攻击者在代理模型培训中使用一些共同的数据增强攻击(例如旋转、作物和调整大小)时,它将完全失败,因为基本的水标记一致性被破坏。为了缓解这一问题,我们提出了一种新的水标记方法,即“结构一致性”,在此基础上设计了一个新的深结构一致的模型水标记算法。具体地说,嵌入水标记的设计是要与实际一致的图像结构相一致,例如边缘或语义区域。实验显示,在代理模型区域中,我们使用某种更稳健健的模型攻击能力比我们较强的数据测试系统更坚固的增强攻击能力,比更坚固的基线方法。