Over the past few decades, addressing "spatial confounding" has become a major topic in spatial statistics. However, the literature has provided conflicting definitions, and many proposed solutions do not address the issue of confounding as it is understood in causal inference. We give a clear account of spatial confounding as the existence of an unmeasured confounding variable with a spatial structure. Under certain conditions, including the measurability of the confounder as a function of space, we show that spatial covariates (e.g. latitude and longitude) can be handled by existing causal inference estimation procedures. We focus on "double machine learning" (DML), a procedure in which flexible models are used to regress both the exposure and outcome variables on confounders to arrive at a causal estimator with favorable robustness properties and convergence rates. These models avoid restrictive assumptions, such as linearity and effect homogeneity, which are typically made in spatial models and which can lead to bias when violated. We demonstrate the advantages of the DML approach analytically and via extensive simulation studies. We apply our methods and reasoning to a study of the effect of fine particulate matter exposure during pregnancy on birthweight in California.
翻译:在过去几十年中,解决“空间混乱”已成为空间统计中的一个主要议题,然而,文献提供了相互矛盾的定义,许多拟议解决办法没有解决因果推断中理解的混淆问题。我们清楚地说明空间混乱与空间结构存在一个无法测量的混杂变量,在某些情况下,包括相融合者作为空间功能的可衡量性,我们表明空间的共变(如纬度和经度)可以通过现有的因果推断估计程序处理。我们注重“双机学习”(DML),这是一个使用灵活模型来逆转接触和结果变量的程序,以便让相融合者得出一个具有有利稳健性和趋同率的因果关系估计因素。这些模型避免了限制性假设,例如线性和同一性,这种假设通常是在空间模型中作出的,一旦被破坏,可能会导致偏差。我们通过广泛的模拟研究,展示DML方法的优势。我们运用灵活模型来回移接触风险和结果变量,以回归风险风险和结果变量,从而得出一个具有有利稳健性和趋同率的因果关系。我们用方法和推理方法,在加利福尼亚州对妊娠期的微粒质接触效果进行研究。