Machine unlearning refers to mechanisms that can remove the influence of a subset of training data upon request from a trained model without incurring the cost of re-training from scratch. This paper develops a unified PAC-Bayesian framework for machine unlearning that recovers the two recent design principles - variational unlearning (Nguyen et.al., 2020) and forgetting Lagrangian (Golatkar et.al., 2020) - as information risk minimization problems (Zhang,2006). Accordingly, both criteria can be interpreted as PAC-Bayesian upper bounds on the test loss of the unlearned model that take the form of free energy metrics.
翻译:机器不学习是指可以应请求从经过培训的模型中消除一组培训数据的影响而无需从零开始再培训费用的机制,本文件为机器不学习制定了统一的PAC-Bayesian框架,以恢复最近的两项设计原则----变式不学习(Nguyen等人,2020年)和忘记Lagrangian(Golatkar等人,2020年)----作为信息风险最小化问题(Zhang 2006),因此,这两项标准可被解释为PAC-Bayesian关于以免费能源衡量为形式的未学习模式测试损失的上限。