Data poisoning attacks spoof a recommender system to make arbitrary, attacker-desired recommendations via injecting fake users with carefully crafted rating scores into the recommender system. We envision a cat-and-mouse game for such data poisoning attacks and their defenses, i.e., new defenses are designed to defend against existing attacks and new attacks are designed to break them. To prevent such a cat-and-mouse game, we propose PORE, the first framework to build provably robust recommender systems in this work. PORE can transform any existing recommender system to be provably robust against any untargeted data poisoning attacks, which aim to reduce the overall performance of a recommender system. Suppose PORE recommends top-$N$ items to a user when there is no attack. We prove that PORE still recommends at least $r$ of the $N$ items to the user under any data poisoning attack, where $r$ is a function of the number of fake users in the attack. Moreover, we design an efficient algorithm to compute $r$ for each user. We empirically evaluate PORE on popular benchmark datasets.
翻译:数据毒化攻击会通过向推荐系统中注入具有精心设计的评分得分的虚假用户,欺骗推荐系统,从而进行任意攻击者所需的推荐。我们设想这种数据毒化攻击和它们的防御是一种猫鼠游戏,即新的防御设计来抵御现有攻击,新的攻击设计用于打破它们。为了防止这种猫鼠游戏,我们提出了 POER 框架,该框架是首个能够在本文中构建出可证明强大的推荐系统的框架。PORE 可以将任何现有的推荐系统转换为对于任何非定向数据毒化攻击具有可证强大性,这些攻击旨在降低推荐系统的综合性能。假设 PORE 在没有攻击时向用户推荐前 $N$ 个项目,我们证明在任何数据毒化攻击下,PORE 仍会向用户推荐至少 $r$ 个项目,其中 $r$ 是攻击中虚假用户数量的函数。此外,我们设计了一种有效的算法来计算每个用户的$r$。我们在流行的基准数据集上进行了实证评估。