Several recently developed methods have the potential to harness machine learning in the pursuit of target quantities inspired by causal inference, including inverse weighting, doubly robust estimating equations and substitution estimators like targeted maximum likelihood estimation. There are even more recent augmentations of these procedures that can increase robustness, by adding a layer of cross-validation (cross-validated targeted maximum likelihood estimation and double machine learning, as applied to substitution and estimating equation approaches, respectively). While these methods have been evaluated individually on simulated and experimental data sets, a comprehensive analysis of their performance across ``real-world'' simulations have yet to be conducted. In this work, we benchmark multiple widely used methods for estimation of the average treatment effect using ten different nutrition intervention studies data. A realistic set of simulations, based on a novel method, highly adaptive lasso, for estimating the data-generating distribution that guarantees a certain level of complexity (undersmoothing) is used to better mimic the complexity of the true data-generating distribution. We have applied this novel method for estimating the data-generating distribution by individual study and to subsequently use these fits to simulate data and estimate treatment effects parameters as well as their standard errors and resulting confidence intervals. Based on the analytic results, a general recommendation is put forth for use of the cross-validated variants of both substitution and estimating equation estimators. We conclude that the additional layer of cross-validation helps in avoiding unintentional over-fitting of nuisance parameter functionals and leads to more robust inferences.
翻译:最近开发的几种方法有可能利用机器学习来追求因果推断所激发的目标数量,包括反加权、加倍强的估计方程式和替代功能估计器,如有针对性的最大概率估计等。最近,这些程序的扩大可以提高稳健性,方法是增加一个交叉校验层(分别用于替代和估计方程方法的交叉校准目标最大概率估计和双机学习,分别用于替代和估计方程方法)。这些方法在模拟和实验数据集上分别进行了评估,但在“现实世界”模拟中,尚未对其业绩进行全面分析。在这项工作中,我们用十种不同的营养干预研究数据为估算平均治疗效果的多种广泛使用的方法设定基准。一套现实的模拟,根据一种新颖的方法,高度适应性拉索,用于估算数据生成的分布,从而保证某种程度的复杂性(偏差),用来更好地模拟真实数据生成值分布的复杂性。我们运用了这种新颖的方法来评估数据生成的分布,在单独研究中,并在随后使用这些对等值的精确度的精确度评估结果中,从而推导出一个基础的模拟数据和排序结果。