消除建议系统中的民众偏见的示范不可允许的反事实理由 (Model-Agnostic Counterfactual Reasoning for Eliminating Popularity Bias in Recommender System)

The general aim of the recommender system is to provide personalized suggestions to users, which is opposed to suggesting popular items. However, the normal training paradigm, i.e., fitting a recommender model to recover the user behavior data with pointwise or pairwise loss, makes the model biased towards popular items. This results in the terrible Matthew effect, making popular items be more frequently recommended and become even more popular. Existing work addresses this issue with Inverse Propensity Weighting (IPW), which decreases the impact of popular items on the training and increases the impact of long-tail items. Although theoretically sound, IPW methods are highly sensitive to the weighting strategy, which is notoriously difficult to tune. In this work, we explore the popularity bias issue from a novel and fundamental perspective -- cause-effect. We identify that popularity bias lies in the direct effect from the item node to the ranking score, such that an item's intrinsic property is the cause of mistakenly assigning it a higher ranking score. To eliminate popularity bias, it is essential to answer the counterfactual question that what the ranking score would be if the model only uses item property. To this end, we formulate a causal graph to describe the important cause-effect relations in the recommendation process. During training, we perform multi-task learning to achieve the contribution of each cause; during testing, we perform counterfactual inference to remove the effect of item popularity. Remarkably, our solution amends the learning process of recommendation which is agnostic to a wide range of models. We demonstrate it on Matrix Factorization (MF) and LightGCN, which are representative of the conventional and state-of-the-art model for collaborative filtering. Experiments on five real-world datasets demonstrate the effectiveness of our method.

翻译：推荐人系统的总目标是向用户提供个性化建议,这与推荐受欢迎的项目相反。但是,正常的培训模式,即安装一个推荐人模型,用点或对针丢失来恢复用户行为数据,使得模型偏向流行项目。这导致了可怕的马修效应,使流行项目更经常地被推荐,甚至更受欢迎。现有工作用反偏重(IPW)解决这个问题,降低了受欢迎项目对培训的影响,增加了长尾项目的影响。尽管理论上是健全的,但IPW方法对加权战略非常敏感,而这种战略很难调和。在这项工作中,我们从新颖和基本的角度探讨受欢迎偏重问题 -- -- 原因效应。我们发现流行偏重在于项目节到排名分的直接效果,因此,项目的内在属性是错误地分。为了消除受欢迎偏重的偏重性偏差,对于反切题来说,必须回答,如果模型只使用项的偏重重度战略,那么IPW方法就会非常敏感,而这种战略很难调调调。在这个工作中,我们从新的和基本的角度来探讨受欢迎的偏重的偏重偏重偏重偏差偏见问题。我们学习了每个研究过程,我们学习了一次的排序,我们学习了一次的排序,我们学习过程的顺序,我们学习了。我们开始一个结果,我们学习了对结果的偏重重重的顺序,我们学习了。我们学习了。我们学习了。