Temporal data, representing chronological observations of complex systems, has always been a typical data structure that can be widely generated by many domains, such as industry, medicine and finance. Analyzing this type of data is extremely valuable for various applications. Thus, different temporal data analysis tasks, eg, classification, clustering and prediction, have been proposed in the past decades. Among them, causal discovery, learning the causal relations from temporal data, is considered an interesting yet critical task and has attracted much research attention. Existing casual discovery works can be divided into two highly correlated categories according to whether the temporal data is calibrated, ie, multivariate time series casual discovery, and event sequence casual discovery. However, most previous surveys are only focused on the time series casual discovery and ignore the second category. In this paper, we specify the correlation between the two categories and provide a systematical overview of existing solutions. Furthermore, we provide public datasets, evaluation metrics and new perspectives for temporal data casual discovery.
翻译:时序数据的因果发现:综述与新视角
时间数据代表了复杂系统的按时间顺序观察记录,是很多领域(例如工业、医疗和金融)广泛生成的类型数据结构。分析这种数据对各种应用非常有价值。因此,过去几十年中提出了不同的时间数据分析任务,例如分类、聚类和预测。其中,从时间数据中学习因果关系的因果发现是一个有趣但也很关键的任务,受到了很多研究的关注。现有的因果发现工作可以根据时间数据是否为加工过的数据分成两个高度相关的类别,即多元时间序列因果发现和事件序列因果发现。然而,大多数先前的调查只关注时间序列因果发现,并忽略了第二类别。在本文中,我们指定了这两种类别之间的相关性,并对现有解决方案进行了系统的概述。此外,我们提供公共数据集、评估指标和时序数据因果发现的新视角。