In randomized experiments, adjusting for observed features when estimating treatment effects has been proposed as a way to improve asymptotic efficiency. However, only linear regression has been proven to form an estimate of the average treatment effect that is asymptotically no less efficient than the treated-minus-control difference in means regardless of the true data generating process. Randomized treatment assignment provides this "do-no-harm" property, with neither truth of a linear model nor a generative model for the outcomes being required. We present a general calibration method which confers the same no-harm property onto estimators leveraging a broad class of nonlinear models. This recovers the usual regression-adjusted estimator when ordinary least squares is used, and further provides non-inferior treatment effect estimators using methods such as logistic and Poisson regression. The resulting estimators are non-inferior to both the difference in means estimator and to treatment effect estimators that have not undergone calibration. We show that our estimator is asymptotically equivalent to an inverse probability weighted estimator using a logit link with predicted potential outcomes as covariates. In a simulation study, we demonstrate that common nonlinear estimators without our calibration procedure may perform markedly worse than both the calibrated estimator and the unadjusted difference in means.
翻译:在随机的实验中,在估计治疗效果时,建议调整观察到的特征,以此作为提高无线反应效率的一种方法。然而,只有线性回归被证明能够形成平均治疗效果的估计数,这种平均治疗效果的随机性效率不亚于处理-最小控制能力的差异,而不论真正的数据生成过程如何。随机性治疗任务提供了这种“不伤害”属性,既没有线性模型的真伪,也没有要求的结果的基因化模型。我们提出了一个一般校准方法,该方法将相同的无伤害属性授予利用大类非线性模型的估测者。在使用普通最小方时,这恢复了通常的回归-调整估计效果,并且进一步提供了非偏差治疗效果的估测者,使用了后勤和Poisson回归等方法。 由此产生的估测者对于数值的差别,既非直线模型的真伪性,也不适用于没有经过校准的估测结果。 我们的估测师与不偏差性偏差的校准性校准结果相比, 我们的测结果与不具有更差性的校准性的校准结果。