Causal discovery identifies causal relationships between variables in a dataset. This study investigates the potential of causal discovery in extracting causal connections from transportation behavioral data. To do so, four causal discovery algorithms are tested: Peter-Clark (PC), Fast Causal Inference (FCI), Fast Greedy Equivalence Search (FGES), and Linear Non-Gaussian Acyclic Models (LiNGAM). Their performances are compared to determine the most appropriate algorithm for travel choice modeling. Next, we propose a novel methodology to combine causal discovery with structural equation modeling (SEM) to model travel mode choice. This modeling approach can overcome some of the limitations of SEM, by combining both the strengths of causal discovery and SEM. The results show that LiNGAM best captures causality in transportation behavior modeling, among the four algorithms tested since the LiNGAM-based SEM achieved the lowest values of Chi-square, Root Mean Square Error of approximation (RMSEA), along with greater than 0.95 Comparative Fit Index (CFI), Goodness-of-Fit Index (GFI), and Adjusted Goodness-of-Fit index (AGFI), Normed Fit Index (NFI), and Tucker-Lewis Index (TLI). The modeling results provide insights in causal relations leading to choosing private vehicles, public transit, or walking as a travel mode. The analyses are conducted on data from the 2017 National Household Travel Survey in the New York Metropolitan area.
翻译:原因发现确定了数据集中变量之间的因果关系。 本研究调查了从运输行为数据中提取因果联系的因果发现潜力。 为此,测试了四种因果发现算法:Peter-Clark(PC)、快速因果推断(FCI)、快速贪婪等价搜索(FGIS)和Linear None-Gausian Acylical 模型(LiNGAMAM),它们的表现被比较为确定旅行选择模型的最合适的算法。接下来,我们提出了一种新方法,将因果发现与结构等式模型(SEM)和模式旅行模式选择相结合。这个模型方法可以克服SEM的一些局限性,将因果发现和SEM的优势结合起来。结果显示,LINGAM最能捕捉到运输行为模型的因果关系,这是自LINGAM(LINGAM)的SEM(Chi-squalferal)以来所测试的四种算法,即“最起码的 " 旅行选择的 " 直观 " (RIMEA) ",以及超过0.95的比较指数(CFI-F-Fet-F-Fet-Leal developheal Inview) Exess real deal deal deal degradustrislational deal deal deal deal)。