In the mobile Internet era, recommender systems have become an irreplaceable tool to help users discover useful items, thus alleviating the information overload problem. Recent research on deep neural network (DNN)-based recommender systems have made significant progress in improving prediction accuracy, largely attributed to the widely accessible large-scale user data. Such data is commonly collected from users' personal devices, and then centrally stored in the cloud server to facilitate model training. However, with the rising public concerns on user privacy leakage in online platforms, online users are becoming increasingly anxious over abuses of user privacy. Therefore, it is urgent and beneficial to develop a recommender system that can achieve both high prediction accuracy and strong privacy protection. To this end, we propose a DNN-based recommendation model called PrivRec running on the decentralized federated learning (FL) environment, which ensures that a user's data is fully retained on her/his personal device while contributing to training an accurate model. On the other hand, to better embrace the data heterogeneity (e.g., users' data vary in scale and quality significantly) in FL, we innovatively introduce a first-order meta-learning method that enables fast on-device personalization with only a few data points. Furthermore, to defend against potential malicious participants that pose serious security threat to other users, we further develop a user-level differentially private model, namely DP-PrivRec, so attackers are unable to identify any arbitrary user from the trained model. Finally, we conduct extensive experiments on two large-scale datasets in a simulated FL environment, and the results validate the superiority of both PrivRec and DP-PrivRec.
翻译:在移动互联网时代,推荐人系统已成为帮助用户发现有用项目的一个不可替代的工具,从而缓解信息超载问题。最近对深神经网络(DNN)推荐人系统的研究在提高预测准确性方面取得了显著进展,这主要归功于可广泛获取的大规模用户数据。这些数据通常从用户个人设备中收集,然后集中储存在云端服务器中,以便利模式培训。然而,随着公众对在线平台用户隐私渗漏的公众关注日益增加,在线用户对滥用用户隐私问题越来越感到焦虑。因此,开发一个既能达到高预测准确度又能提供强的隐私保护的推荐人系统是紧迫和有益的。为此,我们提出了一个基于DNNE的建议模式,称为PrivRec, 其建议模型在分散化的联邦学习(FL)环境中运行,确保用户数据完全保留在她/他的个人设备上,同时帮助培训一个准确的模型。另一方面,为了更好地接受模型的偏差性(例如,用户在规模和质量上的差异数据差异很大。我们创新地引入了一种第一级任意的元化实验方法, 使潜在用户能够快速地在安全上发展一种潜在的系统。