Learned recommender systems may inadvertently leak information about their training data, leading to privacy violations. We investigate privacy threats faced by recommender systems through the lens of membership inference. In such attacks, an adversary aims to infer whether a user's data is used to train the target recommender. To achieve this, previous work has used a shadow recommender to derive training data for the attack model, and then predicts the membership by calculating difference vectors between users' historical interactions and recommended items. State-of-the-art methods face two challenging problems: (1) training data for the attack model is biased due to the gap between shadow and target recommenders, and (2) hidden states in recommenders are not observational, resulting in inaccurate estimations of difference vectors. To address the above limitations, we propose a Debiasing Learning for Membership Inference Attacks against recommender systems (DL-MIA) framework that has four main components: (1) a difference vector generator, (2) a disentangled encoder, (3) a weight estimator, and (4) an attack model. To mitigate the gap between recommenders, a variational auto-encoder (VAE) based disentangled encoder is devised to identify recommender invariant and specific features. To reduce the estimation bias, we design a weight estimator, assigning a truth-level score for each difference vector to indicate estimation accuracy. We evaluate DL-MIA against both general recommenders and sequential recommenders on three real-world datasets. Experimental results show that DL-MIA effectively alleviates training and estimation biases simultaneously, and achieves state-of-the-art attack performance.
翻译:教益系统可能无意中泄漏有关其培训数据的信息,从而导致侵犯隐私。我们通过会员推算的镜头调查建议系统面临的隐私威胁。在这类袭击中,对手的目的是推断用户的数据是否用于培训目标建议者。为此,以前的工作使用了影子建议器来为攻击模式获取培训数据,然后通过计算用户历史互动与推荐项目之间的矢量差异来预测会籍。最先进的方法面临两个具有挑战性的问题:(1)攻击模式的培训数据由于影子与目标建议者之间的差距而偏向于攻击模式的培训数据,以及(2)推荐者中隐藏的状态不是观察性的,导致对差异矢量的估算不准确。为了应对上述局限性,我们建议采用“对成员进行偏差学习”的方法,为攻击模式(DL-MIA)框架,通过计算用户历史互动与推荐项目之间的矢量差异,然后预测成员构成四个主要组成部分:(1) 矢量生成器差异的矢量评估器,(2) 分解的测算器,(3) 重量测度测算器,以及(4) 攻击模式。为了缩小推荐者之间的差距,每个变量水平、自动测算值的自动测算结果, 和我们测算的估测度显示一个特定的计算结果。