Structural Equation Modeling (SEM) has gained popularity in the social sciences and causal inference due to its flexibility in modeling complex relationships between variables and its availability in modern statistical software. To move beyond the parametric assumptions of SEM, this paper reviews targeted maximum likelihood estimation (TMLE), a doubly robust, machine learning-based approach that builds on nonparametric SEM. We demonstrate that both TMLE and SEM can be used to estimate standard causal effects and show that TMLE is robust to model misspecification. We conducted simulation studies under both correct and misspecified model conditions, implementing SEM and TMLE to estimate these causal effects. The simulations confirm that TMLE consistently outperforms SEM under misspecification in terms of bias, mean squared error, and the validity of confidence intervals. We applied both approaches to a real-world dataset to analyze the mediation effects of poverty on access to high school, revealing that the direct effect is no longer significant under TMLE, whereas SEM indicates significance. We conclude with practical guidance on using SEM and TMLE in light of recent developments in targeted learning for causal inference.
翻译:结构方程模型(SEM)因其在变量间复杂关系建模方面的灵活性及在现代统计软件中的广泛可用性,在社会科学和因果推断领域日益普及。为超越SEM的参数化假设,本文综述了目标最大似然估计(TMLE),这是一种基于非参数SEM的双重稳健、机器学习驱动的方法。我们证明TMLE和SEM均可用于估计标准因果效应,并表明TMLE对模型设定错误具有稳健性。我们在正确设定和错误设定模型条件下进行了模拟研究,分别采用SEM和TMLE估计这些因果效应。模拟结果证实,在模型设定错误时,TMLE在偏差、均方误差和置信区间有效性方面均持续优于SEM。我们将两种方法应用于真实数据集,分析贫困对高中入学机会的中介效应,发现TMLE下直接效应不再显著,而SEM则显示显著。最后,结合因果推断中目标学习的最新进展,我们提供了使用SEM和TMLE的实践指导。