提高经校准工作证明的示范采掘成本 (Increasing the Cost of Model Extraction with Calibrated Proof of Work)

In model extraction attacks, adversaries can steal a machine learning model exposed via a public API by repeatedly querying it and adjusting their own model based on obtained predictions. To prevent model stealing, existing defenses focus on detecting malicious queries, truncating, or distorting outputs, thus necessarily introducing a tradeoff between robustness and model utility for legitimate users. Instead, we propose to impede model extraction by requiring users to complete a proof-of-work before they can read the model's predictions. This deters attackers by greatly increasing (even up to 100x) the computational effort needed to leverage query access for model extraction. Since we calibrate the effort required to complete the proof-of-work to each query, this only introduces a slight overhead for regular users (up to 2x). To achieve this, our calibration applies tools from differential privacy to measure the information revealed by a query. Our method requires no modification of the victim model and can be applied by machine learning practitioners to guard their publicly exposed models against being easily stolen.

翻译：在模型抽取攻击中,对手可以偷取通过公开的API暴露的机器学习模型,反复询问该模型,并根据获得的预测调整自己的模型。为了防止模式盗窃,现有的防御侧重于检测恶意查询、缩短或扭曲输出,从而必然在稳健性和模型对合法用户的实用性之间进行权衡。相反,我们提议通过要求用户在阅读模型预测之前完成一项工作证明来阻止模型提取。这通过大大增加(甚至高达100x)利用查询访问进行模型提取所需的计算努力来吓阻攻击者。由于我们调整了为每次查询完成工作证明所需的努力,这只给经常用户带来轻微的间接负担(高达2x)。为了达到这一点,我们的校准运用了不同隐私的工具来衡量查询所披露的信息。我们的方法不需要修改受害者模型,并且可以由机器学习实践者用来保护其公开暴露的模式不被轻易被盗。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/