Temporal data, representing chronological observations of complex systems, has always been a typical data structure that can be widely generated by many domains, such as industry, medicine and finance. Analyzing this type of data is extremely valuable for various applications. Thus, different temporal data analysis tasks, eg, classification, clustering and prediction, have been proposed in the past decades. Among them, causal discovery, learning the causal relations from temporal data, is considered an interesting yet critical task and has attracted much research attention. Existing casual discovery works can be divided into two highly correlated categories according to whether the temporal data is calibrated, ie, multivariate time series casual discovery, and event sequence casual discovery. However, most previous surveys are only focused on the time series casual discovery and ignore the second category. In this paper, we specify the correlation between the two categories and provide a systematical overview of existing solutions. Furthermore, we provide public datasets, evaluation metrics and new perspectives for temporal data casual discovery.
翻译:时序数据表示复杂系统的时间观测,一直以来都是许多领域(如工业、医学和金融)广泛生成的典型数据结构。分析这类数据对于各种应用非常有价值,因此,过去几十年间提出了不同的时序数据分析任务,例如分类、聚类和预测。其中,从时序数据中学习因果关系的因果发现任务被认为是一个有趣但至关重要的任务,吸引了众多研究关注。现有的因果发现工作可以根据时序数据是否被校准分为两个高度关联的类别,即多元时间序列因果发现和事件序列因果发现。然而,大多数以前的综述只关注于第一类(多元时间序列因果发现)而忽略了第二类。本文特别说明了这两类之间的相关性,并提供了现有解决方案的系统综述。此外,我们提供了公共数据集、评估指标和时序数据因果发现的新视角。