取消学习 (Recommendation Unlearning)

Recommender systems provide essential web services by learning users' personal preferences from collected data. However, in many cases, systems also need to forget some training data. From the perspective of privacy, several privacy regulations have recently been proposed, requiring systems to eliminate any impact of the data whose owner requests to forget. From the perspective of utility, if a system's utility is damaged by some bad data, the system needs to forget these data to regain utility. From the perspective of usability, users can delete noise and incorrect entries so that a system can provide more useful recommendations. While unlearning is very important, it has not been well-considered in existing recommender systems. Although there are some researches have studied the problem of machine unlearning in the domains of image and text data, existing methods can not been directly applied to recommendation as they are unable to consider the collaborative information. In this paper, we propose RecEraser, a general and efficient machine unlearning framework tailored to recommendation task. The main idea of RecEraser is to partition the training set into multiple shards and train a constituent model for each shard. Specifically, to keep the collaborative information of the data, we first design three novel data partition algorithms to divide training data into balanced groups based on their similarity. Then, considering that different shard models do not uniformly contribute to the final prediction, we further propose an adaptive aggregation method to improve the global model utility. Experimental results on three public benchmarks show that RecEraser can not only achieve efficient unlearning, but also outperform the state-of-the-art unlearning methods in terms of model utility. The source code can be found at https://github.com/chenchongthu/Recommendation-Unlearning

翻译：推荐者系统通过从收集的数据中学习用户的个人偏好来提供基本的网络服务。但是, 在许多情况下, 系统也需要忘记一些培训数据。从隐私的角度来看, 最近提出了几项隐私条例, 要求系统消除所有者要求忘记的数据的任何影响。从实用的角度来看, 如果一个系统的效用因一些坏数据而受损, 系统需要忘记这些数据才能重新获得效用。从可用性角度看, 用户可以删除噪音和不正确的条目, 以便一个系统能够提供更有用的建议。虽然不学习非常重要, 但现有的推荐基准系统并没有很好考虑。尽管有些研究最近提出了若干隐私条例, 要求消除数据所有人要求忘记的数据的任何影响。从实用的角度来看, 如果系统效用受到某些坏数据损坏, 系统需要忘记这些数据。从可用性的角度看, 系统用户可以删除噪音和不正确的条目, 这样系统可以提供更有用的建议。尽管不学习非常重要, 但现有的推荐基准系统并没有很好地考虑过。尽管有些研究已经做了一些研究, 但是有些研究已经研究, 在图像和文本领域, 现有的方法无法直接应用。我们设计三个新的数据分析方法。

相关内容

MoDELS

关注 0

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

专知会员服务

29+阅读 · 2022年3月5日

【Max Welling】图神经网络知识表示与推荐，Graph Neural Networks for Knowledge Representation and Recommendation

专知会员服务

41+阅读 · 2022年3月4日

专知会员服务

38+阅读 · 2020年11月3日

【KDD2020-Tutorial】自动推荐系统，Automated Recommendation System

专知会员服务

51+阅读 · 2020年8月25日