In modeling multivariate time series for either forecast or policy analysis, it would be beneficial to have figured out the cause-effect relations within the data. Regression analysis, however, is generally for correlation relation, and very few researches have focused on variance analysis for causality discovery. We first set up an equilibrium for the cause-effect relations using a fictitious vector autoregressive model. In the equilibrium, long-run relations are identified from noise, and spurious ones are negligibly close to zero. The solution, called causality distribution, measures the relative strength causing the movement of all series or specific affected ones. If a group of exogenous data affects the others but not vice versa, then, in theory, the causality distribution for other variables is necessarily zero. The hypothesis test of zero causality is the rule to decide a variable is endogenous or not. Our new approach has high accuracy in identifying the true cause-effect relations among the data in the simulation studies. We also apply the approach to estimating the causal factors' contribution to climate change.
翻译:在模拟预测或政策分析的多变时间序列时,如果能找出数据内因果关系,将是有益的。但回溯分析通常是为了关联关系,很少有研究侧重于因果关系发现的差异分析。我们首先利用一个虚拟矢量自动递减模型为因果关系建立平衡。在平衡中,从噪音中找出长期关系,而虚假关系则明显接近于零。解决办法称为因果关系分布,衡量导致所有系列或特定受影响数据流动的相对强度。如果一组外源数据影响其他数据,但反之亦然,则从理论上讲,其他变量的因果关系分布必然为零。假设的因果关系测试是决定变量的规则是内在的,还是非内在的。我们的新办法在确定模拟研究中的数据之间真正的因果关系方面非常精确。我们还采用估计因果因素对气候变化的贡献的方法。