Time series data play an important role in many applications and their analysis reveals crucial information for understanding the underlying processes. Among the many time series learning tasks of great importance, we here focus on semi-supervised learning which benefits of a graph representation of the data. Two main aspects are involved in this task: A suitable distance measure to evaluate the similarities between time series, and a learning method to make predictions based on these distances. However, the relationship between the two aspects has never been studied systematically. We describe four different distance measures, including (Soft) DTW and Matrix Profile, as well as four successful semi-supervised learning methods, including the graph Allen- Cahn method and a Graph Convolutional Neural Network. We then compare the performance of the algorithms on standard data sets. Our findings show that all measures and methods vary strongly in accuracy between data sets and that there is no clear best combination to employ in all cases.
翻译:时间序列数据在许多应用中起着重要作用,它们的分析揭示了了解基本过程的关键信息。在许多非常重要的时间序列学习任务中,我们在此侧重于半监督的学习,这是数据图表显示的好处。这项任务涉及两个主要方面:评估时间序列之间相似之处的适当距离测量,以及根据这些距离作出预测的学习方法。然而,这两个方面之间的关系从未系统地研究过。我们描述了四种不同的距离测量方法,包括(软) DTW和矩阵剖析,以及四种成功的半监督学习方法,包括Allen-Cahn图式方法和一个图表革命神经网络。我们随后比较了标准数据集算法的性能。我们的调查结果显示,所有测量和方法在数据集的准确性方面差异很大,而且在所有情况下都没有明显的最佳组合。