The estimation of Average Treatment Effect (ATE) as a causal parameter is carried out in two steps, wherein the first step, the treatment, and outcome are modeled to incorporate the potential confounders, and in the second step, the predictions are inserted into the ATE estimators such as the Augmented Inverse Probability Weighting (AIPW) estimator. Due to the concerns regarding the nonlinear or unknown relationships between confounders and the treatment and outcome, there has been an interest in applying non-parametric methods such as Machine Learning (ML) algorithms instead. \cite{farrell2018deep} proposed to use two separate Neural Networks (NNs) where there's no regularization on the network's parameters except the Stochastic Gradient Descent (SGD) in the NN's optimization. Our simulations indicate that the AIPW estimator suffers extensively if no regularization is utilized. We propose the normalization of AIPW (referred to as nAIPW) which can be helpful in some scenarios. nAIPW, provably, has the same properties as AIPW, that is double-robustness and orthogonality \citep{chernozhukov2018double}. Further, if the first step algorithms converge fast enough, under regulatory conditions \citep{chernozhukov2018double}, nAIPW will be asymptotically normal.
翻译:将平均治疗效果(ATE)估计为因果参数是分两个步骤进行的,第一步、处理和结果模型化以纳入潜在的混杂者,第二步,预测被插入ATE估计值中,例如,反向概率加权(AIPW)估算值。由于对混淆者与治疗和结果之间非线性或未知关系的关切,人们有兴趣采用非参数方法,如机器学习(ML)算法。\cite{farrell2018dep}提议使用两个独立的神经网络(NNS),网络参数没有正规化, NNN的优化中除Stochatical Egradient Spedient Speople (SGD) 。我们的模拟表明,AIPWadormority(称为nAIPW)如果没有正规化,那么AIPW(称为nAIPW)将在某些情景中有所帮助。 nIPW, rbrocialalality, 和rblental-rmalationality, 将具有相同的特性。