与反反反反反反事实学习有关的干预建议,以更好地了解用户首选项 (Interventional Recommendation with Contrastive Counterfactual Learning for Better Understanding User Preferences)

Recently, there has been a surging interest in formulating recommendations in the context of causal inference. The studies regard the recommendation as an intervention in causal inference and frame the users' preferences as interventional effects to improve recommender systems' generalization. Many studies in the field of causal inference for recommender systems have been focusing on utilizing propensity scores from the causal community that reduce the bias while inducing additional variance. Alternatively, some studies suggest the existence of a set of unbiased data from randomized controlled trials while it requires to satisfy certain assumptions that may be challenging in practice. In this paper, we first design a causal graph representing recommender systems' data generation and propagation process. Then, we reveal that the underlying exposure mechanism biases the maximum likelihood estimation (MLE) on observational feedback. In order to figure out users' preferences in terms of causality behind data, we leverage the back-door adjustment and do-calculus, which induces an interventional recommendation model (IREC). Furthermore, considering the confounder may be inaccessible for measurement, we propose a contrastive counterfactual learning method (CCL) for simulating the intervention. In addition, we present two extra novel sampling strategies and show an intriguing finding that sampling from counterfactual sets contributes to superior performance. We perform extensive experiments on two real-world datasets to evaluate and analyze the performance of our model IREC-CCL on unbiased test sets. Experimental results demonstrate our model outperforms the state-of-the-art methods.

翻译：最近,在因果推断的背景下,对拟订建议的兴趣日益浓厚。研究认为该建议是因果推断中的干预,将用户偏好框架视为改进建议系统一般化的干预效果。建议系统因果推断领域的许多研究一直侧重于利用因果社区的偏差分数,以减少偏差,同时引起更多的差异。或者,一些研究表明存在一套来自随机控制的试验的不偏倚的数据,同时需要满足在实践中可能具有挑战性的某些假设。在本文中,我们首先设计一个代表建议系统数据生成和传播过程的因果图表。然后,我们发现基本接触机制会影响观察反馈的最大可能性估计(MLE ) 。为了在数据背后的因果关系方面找出用户的偏好,我们利用了后门调整和度计算模型(IREC ) 。此外,考虑到测量可能无法利用同级分析器,我们提出了一种对比反向反向学习的方法(CCL ) 来模拟实际干预的结果。然后,我们发现基本的接触机制会影响观察反馈的最大可能性估计(MLE) 。为了从数据的因果关系方面找出用户偏差的测试方法,我们从实际的两套测试方法,我们从实际的实验性测试方法来展示。