学习进推反事实:估算多重平衡待遇效果的元学习 (Learning to Infer Counterfactuals: Meta-Learning for Estimating Multiple Imbalanced Treatment Effects)

We regularly consider answering counterfactual questions in practice, such as "Would people with diabetes take a turn for the better had they choose another medication?". Observational studies are growing in significance in answering such questions due to their widespread accumulation and comparatively easier acquisition than Randomized Control Trials (RCTs). Recently, some works have introduced representation learning and domain adaptation into counterfactual inference. However, most current works focus on the setting of binary treatments. None of them considers that different treatments' sample sizes are imbalanced, especially data examples in some treatment groups are relatively limited due to inherent user preference. In this paper, we design a new algorithmic framework for counterfactual inference, which brings an idea from Meta-learning for Estimating Individual Treatment Effects (MetaITE) to fill the above research gaps, especially considering multiple imbalanced treatments. Specifically, we regard data episodes among treatment groups in counterfactual inference as meta-learning tasks. We train a meta-learner from a set of source treatment groups with sufficient samples and update the model by gradient descent with limited samples in target treatment. Moreover, we introduce two complementary losses. One is the supervised loss on multiple source treatments. The other loss which aligns latent distributions among various treatment groups is proposed to reduce the discrepancy. We perform experiments on two real-world datasets to evaluate inference accuracy and generalization ability. Experimental results demonstrate that the model MetaITE matches/outperforms state-of-the-art methods.

翻译：我们经常考虑在实践中回答反事实问题,比如“糖尿病患者如果选择另一种药物是否会转好点呢?”。观察研究在回答这些问题方面的重要性越来越大,因为其积累广泛,而且比随机化控制试验(RCTs)更容易获得。最近,一些作品将代表性学习和领域适应引入反事实推论。然而,大多数当前工作的重点是确定二进制治疗。没有人认为不同治疗的样本大小不平衡,特别是某些治疗组中的数据例子由于用户的固有偏好而相对有限。在本文件中,我们设计了一个反事实推论的新算法框架,从估算个人治疗效果(MetateITE)的元学习中引出一个想法,以填补上述研究差距,特别是考虑到多重不平衡的治疗。我们把反事实推论治疗组中的数据事件视为元学习任务。我们从一组源治疗组中培训一个具有足够样品的元分解器,并以有限的样本更新模型。此外,我们用目标处理中的梯层下降模型。我们引入了两个模型,从估算个人治疗效果的模型,我们引入了两种估算性实验方法。我们监督了两种实验组的精确度,一种是测测算结果。