In the study of causal inference, statisticians show growing interest in estimating and analyzing heterogeneity in causal effects in observational studies. However, there usually exists a trade-off between accuracy and interpretability when developing a desirable estimator for treatment effects. To make efforts to address the issue, we propose a non-parametric framework for estimating the Conditional Average Treatment Effect (CATE) function in this paper. The framework integrates two components: (i) leverage the joint use of propensity and prognostic scores in a matching algorithm to obtain a proxy of the heterogeneous treatment effects for each observation, (ii) utilize non-parametric regression trees to construct an estimator for the CATE function conditioning on the two scores. The method naturally stratifies treatment effects into subgroups over a 2d grid whose axis are the propensity and prognostic scores. We conduct benchmark experiments on multiple simulated data and demonstrate clear advantages of the proposed estimator over state of the art methods. We also evaluate empirical performance in real-life settings, using two observational social studies in the United States, and interpret policy implications following the numerical results.
翻译:在因果推断研究中,统计人员表示对估计和分析观察研究中因果效应的异质性的兴趣越来越大,但是,在为治疗效果制定理想估计标准时,通常在准确性和可解释性之间有一个权衡。为了努力解决这一问题,我们提议了一个非参数框架来估计本文件中的条件平均治疗效果(CATE)的功能。框架包括两个组成部分:(一) 利用在匹配算法中共同使用适应性和预测性分数的方法,以获得每种观测结果的不同处理效果的替代物;(二) 利用非参数回归树来为CATE函数设置一个根据两个分数调整的估测器。这种方法自然地将治疗效果分到轴轴为敏性和预测分数的2D格上的子组。我们对多个模拟数据进行基准试验,并表明拟议估计者对艺术方法现状的明显优势。我们还利用美国的两项观察社会研究,评估实际生活环境中的经验表现,并解释在数字结果之后的政策影响。