The estimation of Average Treatment Effect (ATE) as a causal parameter is carried out in two steps, where in the first step, the treatment and outcome are modeled to incorporate the potential confounders, and in the second step, the predictions are inserted into the ATE estimators such as the Augmented Inverse Probability Weighting (AIPW) estimator. Due to the concerns regarding the nonlinear or unknown relationships between confounders and the treatment and outcome, there has been an interest in applying non-parametric methods such as Machine Learning (ML) algorithms instead. Some literature proposes to use two separate Neural Networks (NNs) where there's no regularization on the network's parameters except the Stochastic Gradient Descent (SGD) in the NN's optimization. Our simulations indicate that the AIPW estimator suffers extensively if no regularization is utilized. We propose the normalization of AIPW (referred to as nAIPW) which can be helpful in some scenarios. nAIPW, provably, has the same properties as AIPW, that is, the double-robustness and orthogonality properties. Further, if the first step algorithms converge fast enough, under regulatory conditions, nAIPW will be asymptotically normal. We also compare the performance of AIPW and nAIPW in terms of the bias and variance when small to moderate L1 regularization is imposed on the NNs.
 翻译:将平均治疗效果(ATE)估计为因果参数是按两个步骤进行的,第一步是模拟处理和结果,以纳入潜在的混杂者,第二步是将预测插入ATE估计值,例如反向概率加权估算值(AIPW)估计值。由于对混淆者与治疗和结果之间非线性或未知关系的关切,人们有兴趣采用非参数方法,例如机器学习算法。有些文献提议使用两个单独的神经网络,网络参数没有正规化,只有NNNW优化中的存储梯度梯度源(SGD)除外。我们的模拟表明,如果没有利用非线性关系以及处理和结果,则AIPW的估算值会大受影响。我们提议将AIPW(称为nAIPW)的正常化(称为nAIPW)在某些情景中有所帮助。可以肯定的是,在AIPW的常规性参数上具有相同的两个特性,如果监管性能水平下具有足够稳定性,那么在AIPIP值下也具有相同的特性。