Research in the past few decades has discussed the concept of "spatial confounding" but has provided conflicting definitions and proposed solutions, some of which do not address the issue of confounding as it is understood in the field of causal inference. We give a clear account of spatial confounding as the existence of an unmeasured confounding variable with a spatial structure. Under certain conditions, including the smoothness of the confounder as a function of space, we show that spatial covariates (e.g., latitude and longitude) can be handled as typical covariates by algorithms popular in causal inference. We focus on "double machine learning" (DML) by which flexible models are fit for both the exposure and outcome variables to arrive at a causal estimator with favorable convergence properties. These models avoid restrictive assumptions, such as linearity and heterogeneity, which are present in linear models typically employed in spatial statistics and which can lead to strong bias when violated. We demonstrate the advantages of the DML approach analytically and via extensive simulation studies.
翻译:过去几十年的研究讨论了“空间混淆”的概念,但提供了相互矛盾的定义和拟议解决办法,其中一些定义和解决办法没有解决因果推断领域所理解的混淆问题。我们清楚地说明空间混淆是一个与空间结构不测的混杂变量的存在。在某些条件下,包括混杂者作为空间函数的平滑性,我们表明空间共变(如纬度和经度)可以作为典型的共变法处理,在因果推断中很受欢迎。我们注重“双机学习”(DML),根据这种“双机学习”(DML),灵活模型适合暴露和结果变量,从而得出具有有利趋同特性的因果关系估计变量。这些模型避免了限制性假设,如在空间统计中通常采用的线性模型中存在的线性性和异性,在被破坏时可能导致强烈的偏差。我们通过广泛的模拟研究,展示了DML方法的分析优势。