如何使你的逆向更稳健:目标明确、高效的防盗模型,带有渐变偏向 (How to Steer Your Adversary: Targeted and Efficient Model Stealing Defenses with Gradient Redirection)

Model stealing attacks present a dilemma for public machine learning APIs. To protect financial investments, companies may be forced to withhold important information about their models that could facilitate theft, including uncertainty estimates and prediction explanations. This compromise is harmful not only to users but also to external transparency. Model stealing defenses seek to resolve this dilemma by making models harder to steal while preserving utility for benign users. However, existing defenses have poor performance in practice, either requiring enormous computational overheads or severe utility trade-offs. To meet these challenges, we present a new approach to model stealing defenses called gradient redirection. At the core of our approach is a provably optimal, efficient algorithm for steering an adversary's training updates in a targeted manner. Combined with improvements to surrogate networks and a novel coordinated defense strategy, our gradient redirection defense, called GRAD${}^2$, achieves small utility trade-offs and low computational overhead, outperforming the best prior defenses. Moreover, we demonstrate how gradient redirection enables reprogramming the adversary with arbitrary behavior, which we hope will foster work on new avenues of defense.

翻译：模型盗窃攻击是公共机器学习API的两难处境。为保护金融投资,公司可能被迫隐瞒有助于盗窃的模型的重要信息,包括不确定性估计和预测解释。这种妥协不仅对用户有害,而且对外部透明度有害。模型盗窃国防试图通过使模型更难偷窃而同时保护良性用户的效用来解决这一难题。但是,现有的防御在实践中表现不佳,要么需要巨大的计算间接费用,要么需要严重的公用事业权衡。为了应对这些挑战,我们提出了一种新的方法来模拟盗窃称为梯度重定向的防御。我们的方法的核心是,一种可以实现最佳和高效的算法,以有针对性的方式指导对手培训更新。再加上对代用网络的改进和新的协调防御战略,我们的梯度重置防御,称为GRAD$$2$,实现小的效用交换和低计算费,超过了先前的最佳防御。此外,我们展示了梯度重定位如何使对手能够以任意行为重新制定方案,我们希望这将促进新的防御途径的工作。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日