Natural language processing (NLP) technology has shown great commercial value in applications such as sentiment analysis. But NLP models are vulnerable to the threat of pirated redistribution, damaging the economic interests of model owners. Digital watermarking technology is an effective means to protect the intellectual property rights of NLP model. The existing NLP model protection mainly designs watermarking schemes by improving both security and robustness purposes, however, the security and robustness of these schemes have the following problems, respectively: (1) Watermarks are difficult to defend against fraudulent declaration by adversary and are easily detected and blocked from verification by human or anomaly detector during the verification process. (2) The watermarking model cannot meet multiple robustness requirements at the same time. To solve the above problems, this paper proposes a novel watermarking framework for NLP model based on the over-parameterization of depth model and the multi-task learning theory. Specifically, a covert trigger set is established to realize the perception-free verification of the watermarking model, and a novel auxiliary network is designed to improve the robustness and security of the watermarking model. The proposed framework was evaluated on two benchmark datasets and three mainstream NLP models, and the results show that the framework can successfully validate model ownership with 100% validation accuracy and advanced robustness and security without compromising the host model performance.
翻译:自然语言处理(NLP)技术在情绪分析等应用中显示出巨大的商业价值。但是,NLP模型很容易受到盗版再分配的威胁,破坏模型拥有者的经济利益。数字水标记技术是保护NLP模型知识产权的有效手段。现有的NLP模型保护主要通过改进安全和稳健性目的设计水标记计划,然而,这些计划的安全和稳健性分别存在下列问题:(1) 水标记难以抵御对手的欺诈性申报,在核查过程中很容易被人类或异常探测器探测到并阻止核查。 (2) 水标记模型无法同时满足多重稳健性要求。为解决上述问题,本文件提议了一个新的NLP模型水标记框架,其基础是深度模型和多功能学习理论,但具体地说,为了实现对水标记模型的无感知性核查,建立了一套隐蔽触发装置,并设计了一个新型辅助网络,以改善水标记模型的稳健性和安全性。拟议的基准框架以两种基准化模型和高级安全性模型为基础,可以成功地评估稳妥性模型和升级性模型。