Existing integrity verification approaches for deep models are designed for private verification (i.e., assuming the service provider is honest, with white-box access to model parameters). However, private verification approaches do not allow model users to verify the model at run-time. Instead, they must trust the service provider, who may tamper with the verification results. In contrast, a public verification approach that considers the possibility of dishonest service providers can benefit a wider range of users. In this paper, we propose PublicCheck, a practical public integrity verification solution for services of run-time deep models. PublicCheck considers dishonest service providers, and overcomes public verification challenges of being lightweight, providing anti-counterfeiting protection, and having fingerprinting samples that appear smooth. To capture and fingerprint the inherent prediction behaviors of a run-time model, PublicCheck generates smoothly transformed and augmented encysted samples that are enclosed around the model's decision boundary while ensuring that the verification queries are indistinguishable from normal queries. PublicCheck is also applicable when knowledge of the target model is limited (e.g., with no knowledge of gradients or model parameters). A thorough evaluation of PublicCheck demonstrates the strong capability for model integrity breach detection (100% detection accuracy with less than 10 black-box API queries) against various model integrity attacks and model compression attacks. PublicCheck also demonstrates the smooth appearance, feasibility, and efficiency of generating a plethora of encysted samples for fingerprinting.
翻译:为深层模型设计了现有廉正核查办法,用于私人核查(即假设服务提供商诚实,使用白箱访问模型参数);然而,私人核查办法不允许示范用户在运行时对模型进行核实;相反,他们必须信任服务提供者,因为供应商可能会篡改核查结果;相反,一种公共核查办法,即考虑不诚实服务提供者的可能性,可以使更广泛的用户受益;在本文件中,我们建议公众检查,一种实用的公共廉正核查办法,用于运行时间深度模型的服务;公众检查,考虑不诚实的服务提供者,克服公共核查挑战,即其体重轻,提供反伪造保护,以及指纹样本看起来光滑;要捕捉和鉴别运行时模型的固有预测行为,公众检查产生顺利变化,并增加嵌入于模型决定界限的精度样本;同时确保核查询问与正常查询不相干;当目标模型知识有限时,公众检查也适用公开核查(例如,对梯度或模型参数的了解,没有提供反反伪造保护,并且指纹样本样本样本样本显示样本样本样本样本样本样本样本样本;对10级的准确性检查,还展示了一种强的准确性检查能力,以检测10度攻击的准确性检查。