Observed events in recommendation are consequence of the decisions made by a policy, thus they are usually selectively labeled, namely the data are Missing Not At Random (MNAR), which often causes large bias to the estimate of true outcomes risk. A general approach to correct MNAR bias is performing small Randomized Controlled Trials (RCTs), where an additional uniform policy is employed to randomly assign items to each user. In this work, we concentrate on the fairness of RCTs under both homogeneous and heterogeneous demographics, especially analyzing the bias for the least favorable group on the latter setting. Considering RCTs' limitations, we propose a novel Counterfactual Robust Risk Minimization (CRRM) framework, which is totally free of expensive RCTs, and derive its theoretical generalization error bound. At last, empirical experiments are performed on synthetic tasks and real-world data sets, substantiating our method's superiority both in fairness and generalization.
翻译:建议中观察到的事件是政策决定的结果,因此通常被选择性地标为数据失踪不是随机数据(MNAR),这往往对真实结果风险的估计产生很大的偏差。纠正 MNAR偏差的一般做法是小型随机控制试验,采用新的统一政策随机地向每个用户分配项目。在这项工作中,我们集中关注在单一和不同人口结构下RCT的公平性,特别是分析在后一种情况下对最不利群体的偏差。考虑到RCT的局限性,我们提出一个新的反事实强风险最小化框架(CRRM),完全没有昂贵的RCT,并得出其理论上的笼统错误。最后,在合成任务和真实世界数据集上进行了实验,证明了我们的方法在公平和概括方面优势。