高效的属性不学习: 走向选择性地删除地物代表处的输入属性 (Efficient Attribute Unlearning: Towards Selective Removal of Input Attributes from Feature Representations)

Recently, the enactment of privacy regulations has promoted the rise of the machine unlearning paradigm. Existing studies of machine unlearning mainly focus on sample-wise unlearning, such that a learnt model will not expose user's privacy at the sample level. Yet we argue that such ability of selective removal should also be presented at the attribute level, especially for the attributes irrelevant to the main task, e.g., whether a person recognized in a face recognition system wears glasses or the age range of that person. Through a comprehensive literature review, it is found that existing studies on attribute-related problems like fairness and de-biasing learning cannot address the above concerns properly. To bridge this gap, we propose a paradigm of selectively removing input attributes from feature representations which we name `attribute unlearning'. In this paradigm, certain attributes will be accurately captured and detached from the learned feature representations at the stage of training, according to their mutual information. The particular attributes will be progressively eliminated along with the training procedure towards convergence, while the rest of attributes related to the main task are preserved for achieving competitive model performance. Considering the computational complexity during the training process, we not only give a theoretically approximate training method, but also propose an acceleration scheme to speed up the training process. We validate our method by spanning several datasets and models and demonstrate that our design can preserve model fidelity and reach prevailing unlearning efficacy with high efficiency. The proposed unlearning paradigm builds a foundation for future machine unlearning system and will become an essential component of the latest privacy-related legislation.

翻译：最近,隐私条例的颁布促进了机器不学习范式的兴起。现有的机器不学习研究主要侧重于抽样学不学,因此,学习的模型不会暴露用户在抽样一级的隐私。然而,我们争辩说,这种选择性删除的能力也应在属性层面提出,特别是对于与主要任务无关的属性而言,例如,在面对面识别系统中被承认的人是否戴眼镜或该人的年龄范围。通过综合文献审查,发现关于公平性和降低偏见学习等属性相关问题的现有研究无法正确解决上述关切。为了缩小这一差距,我们建议一种有选择地从我们称之为“归属不学习”的特征展示中去除输入属性的模式。在这一模式中,某些属性将准确地被捕获,并脱离在培训阶段学到的特征表现,例如,在培训程序的同时将逐渐消除特定属性,走向趋同,同时保留与主要任务相关的属性,以实现竞争性示范性业绩。考虑到在培训过程中的计算性复杂性不足,我们不仅提出有选择地从我们称之为“归属不学习”的特征上删除输入属性属性。在这个模式中,我们还提出一个只是从理论上接近性地展示我们的最新培训方法,并且提出一种更新我们当前培训方法。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日