Even though observational data contain an enormous number of covariates, the existence of unobserved confounders still cannot be excluded and remains a major barrier to drawing causal inference from observational data. A large-scale propensity score (LSPS) approach may adjust for unobserved confounders by including tens of thousands of available covariates that may be correlated with them. In this paper, we present conditions under which LSPS can remove bias due to unobserved confounders. In addition, we show that LSPS may avoid bias that can be induced when adjusting for various unwanted variables (e.g., M-structure colliders). We demonstrate the performance of LSPS on bias reduction using both simulations and real medical data.
翻译:尽管观测数据包含大量共变数据,但未观察到的困惑者的存在仍然不能排除,仍然是从观测数据中得出因果推论的主要障碍。大规模适应性评分(LSPS)方法可以对未观察到的困惑者进行调整,包括数万种可能与它们相关的可用共变数据。在本文中,我们介绍了LSPS可以消除未观察到的困惑者偏见的条件。此外,我们表明,LSPS可以避免在调整各种不想要的变量(如M结构对撞器)时可能导致的偏见。我们用模拟和真实医学数据来证明LSPS在减少偏见方面的表现。