Understanding how treatment effects vary on individual characteristics is critical in the contexts of personalized medicine, personalized advertising and policy design. When the characteristics are of practical interest are only a subset of full covariate, non-parametric estimation is often desirable; but few methods are available due to the computational difficult. Existing non-parametric methods such as the inverse probability weighting methods have limitations that hinder their use in many practical settings where the values of propensity scores are close to 0 or 1. We propose the propensity score regression (PSR) that allows the non-parametric estimation of the heterogeneous treatment effects in a wide context. PSR includes two non-parametric regressions in turn, where it first regresses on the propensity scores together with the characteristics of interest, to obtain an intermediate estimate; and then, regress the intermediate estimates on the characteristics of interest only. By including propensity scores as regressors in the non-parametric manner, PSR is capable of substantially easing the computational difficulty while remain (locally) insensitive to any value of propensity scores. We present several appealing properties of PSR, including the consistency and asymptotical normality, and in particular the existence of an explicit variance estimator, from which the analytical behaviour of PSR and its precision can be assessed. Simulation studies indicate that PSR outperform existing methods in varying settings with extreme values of propensity scores. We apply our method to the national 2009 flu survey (NHFS) data to investigate the effects of seasonal influenza vaccination and having paid sick leave across different age groups.
翻译:理解个体特征上的治疗效应如何变化在个性化医学、个性化广告和政策设计的背景下至关重要。当特征只是一些全面协变量的基本兴趣时,非参数估计通常是可取的。但是,由于计算困难,很少有方法可用。现有的非参数方法(如倒数概率加权法)存在限制,防碍了它们在许多实际环境中的使用,特别是在倾向性得分的值接近0或1时。我们提出了倾向性得分回归(PSR),在广泛的情况下允许非参数估计异质性治疗效应。PSR依次包括两个非参数回归,其中它首先回归得分倾向性以及感兴趣的特征,以获得中间估计;然后,将中间估计仅回归到感兴趣的特征上。通过以非参数方式包括倾向性得分的回归器,PSR能够极大地减轻计算困难,同时保持对倾向得分的任何值(局部)不敏感。我们提出了PSR的几个吸引人的属性,包括一致性和渐近正常性,特别是存在一个明确的方差估计量,从中可以评估PSR的分析行为和精度。仿真研究表明,在具有极端倾向性得分值的不同情况下,PSR优于现有方法。我们将我们的方法应用于全国2009年流感调查(NHFS)数据,以研究不同年龄组的季节性流感疫苗接种和带薪病假的效应。