Discovering causal relationships between different variables from time series data has been a long-standing challenge for many domains such as climate science, finance, and healthcare. Given the complexity of real-world relationships and the nature of observations in discrete time, causal discovery methods need to consider non-linear relations between variables, instantaneous effects and history-dependent noise (the change of noise distribution due to past actions). However, previous works do not offer a solution addressing all these problems together. In this paper, we propose a novel causal relationship learning framework for time-series data, called Rhino, which combines vector auto-regression, deep learning and variational inference to model non-linear relationships with instantaneous effects while allowing the noise distribution to be modulated by historical observations. Theoretically, we prove the structural identifiability of Rhino. Our empirical results from extensive synthetic experiments and two real-world benchmarks demonstrate better discovery performance compared to relevant baselines, with ablation studies revealing its robustness under model misspecification.
翻译:从时间序列数据中发现不同变量之间的因果关系对气候科学、金融和医疗保健等许多领域来说是一个长期的挑战。鉴于现实世界关系的复杂性和在离散时间进行观测的性质,因果发现方法需要考虑变量、瞬时效应和历史噪音之间的非线性关系(由于过去的行动,噪音分布的变化)。然而,以前的工作并不能提供共同解决所有这些问题的解决办法。在本文件中,我们提议为时间序列数据建立一个新的因果关系学习框架,称为Rhino,它将矢量自动反射、深度学习和变异推论结合起来,以模拟具有瞬时效应的非线性关系,同时允许历史观察对噪音分布进行调节。理论上,我们证明了Rhino的结构特征。我们从广泛的合成实验和两个真实世界基准中得出的实验结果与相关基线相比,显示了更好的发现性表现。在模型误差下,我们进行了实验研究,揭示了它的强健性。