Causal phenomena associated with rare events frequently occur across a wide range of engineering and mathematical problems, such as risk-sensitive safety analysis, accident analysis and prevention, and extreme value theory. However, current methods for causal discovery are often unable to uncover causal links between random variables that manifest only when the variables first experience low-probability realizations. To address this issue, we introduce a novel algorithm that performs statistical independence tests on data collected from time-invariant dynamical systems in which rare but consequential events occur. We seek to understand if the state of the dynamical system causally affects the likelihood of the rare event. In particular, we exploit the time-invariance of the underlying data to superimpose the occurrences of rare events, thus creating a new dataset, with rare events are better represented, on which conditional independence tests can be more efficiently performed. We provide non-asymptotic bounds for the consistency of our algorithm, and validate the performance of our algorithm across various simulated scenarios, with applications to traffic accidents.
翻译:与稀有事件相关的原因现象经常发生于一系列广泛的工程和数学问题,如风险敏感安全分析、事故分析和预防以及极端价值理论。然而,目前的因果发现方法往往无法发现随机变量之间的因果关系,这些变量只有在变量首先经历低概率认识时才会显现出来。为解决这一问题,我们引入了一种新型算法,对从时而异的动态系统收集的数据进行统计独立测试,在这些数据中发生罕见但附带事件。我们试图了解动态系统的状况是否因果地影响稀有事件的可能性。特别是,我们利用基本数据的时间变化来取代稀有事件发生的可能性,从而创建了新的数据集,以更少的事件为代表,在此基础上可以更有效地进行有条件的独立测试。我们为我们的算法的一致性提供了非无症状的界限,并验证了我们在各种模拟情况下的算法绩效,并应用交通事故。