Nonparametric estimators, such as the augmented inverse probability weighted (AIPW) estimator, have become increasingly popular in causal inference. Numerous nonparametric estimators have been proposed, but they are all asymptotically normal with the same asymptotic variance under similar conditions, leaving little guidance for practitioners to choose an estimator. In this paper, I focus on another important perspective of their asymptotic behaviors beyond asymptotic normality, the convergence of the Wald-confidence interval (CI) coverage to the nominal coverage. Such results have been established for simpler estimators (e.g., the Berry-Esseen Theorem), but are lacking for nonparametric estimators. I consider a simple but practical setting where the AIPW estimator based on a black-box nuisance estimator, with or without cross-fitting, is used to estimate the average treatment effect in randomized controlled trials. I derive non-asymptotic Berry-Esseen-type bounds on the difference between Wald-CI coverage and the nominal coverage. I also analyze the bias of variance estimators, showing that the cross-fit variance estimator might overestimate while the non-cross-fit variance estimator might underestimate, which might explain why cross-fitting has been empirically observed to improve Wald-CI coverage.
翻译:在因果推断中,非参数估计量,如增强逆概率加权(AIPW)估计量,已变得越来越流行。尽管已提出了许多非参数估计量,但在类似条件下它们都具有相同的渐近正态性和渐近方差,这为实践者选择估计量提供的指导甚少。在本文中,我关注其渐近行为中除渐近正态性之外的另一个重要视角,即Wald置信区间(CI)覆盖率向名义覆盖率的收敛。此类结果已在较简单的估计量(例如,Berry-Esseen定理)中得到确立,但对于非参数估计量尚属缺乏。我考虑一个简单但实用的设定:在随机对照试验中,使用基于黑盒干扰估计量(无论是否采用交叉拟合)的AIPW估计量来估计平均处理效应。我推导了关于Wald-CI覆盖率与名义覆盖率之间差异的非渐近Berry-Esseen型界。我还分析了方差估计量的偏差,表明交叉拟合方差估计量可能高估方差,而非交叉拟合方差估计量可能低估方差,这或许可以解释为何经验上观察到交叉拟合能改善Wald-CI覆盖率。