We introduce LLA, an effective intellectual property (IP) protection scheme for generative AI models. LLA leverages the synergy between hardware and software to defend against various supply chain threats, including model theft, model corruption, and information leakage. On the software side, it embeds key bits into neurons that can trigger outliers to degrade performance and applies invariance transformations to obscure the key values. On the hardware side, it integrates a lightweight locking module into the AI accelerator while maintaining compatibility with various dataflow patterns and toolchains. An accelerator with a pre-stored secret key acts as a license to access the model services provided by the IP owner. The evaluation results show that LLA can withstand a broad range of oracle-guided key optimization attacks, while incurring a minimal computational overhead of less than 0.1% for 7,168 key bits.
翻译:本文提出LLA,一种针对生成式人工智能模型的有效知识产权保护方案。LLA利用硬件与软件的协同作用,抵御包括模型窃取、模型篡改和信息泄露在内的多种供应链威胁。在软件层面,该方法将密钥位嵌入神经元中,可触发异常值以降低模型性能,并应用不变性变换以隐藏密钥值。在硬件层面,它将轻量级锁定模块集成至AI加速器中,同时保持与多种数据流模式和工具链的兼容性。搭载预存储密钥的加速器可作为访问知识产权所有者所提供模型服务的许可凭证。评估结果表明,LLA能够抵御广泛的预言机引导密钥优化攻击,同时在嵌入7,168个密钥位的情况下仅产生低于0.1%的计算开销。