Classification of sequences of temporal intervals is a part of time series analysis which concerns series of events. We propose a new method of transforming the problem to a task of multivariate series classification. We use one of the state-of-the-art algorithms from the latter domain on the new representation to obtain significantly better accuracy than the state-of-the-art methods from the former field. We discuss limitations of this workflow and address them by developing a novel method for classification termed COSTI (short for Classification of Sequences of Temporal Intervals) operating directly on sequences of temporal intervals. The proposed method remains at a high level of accuracy and obtains better performance while avoiding shortcomings connected to operating on transformed data. We propose a generalized version of the problem of classification of temporal intervals, where each event is supplemented with information about its intensity. We also provide two new data sets where this information is of substantial value.
翻译:时间间隔序列的分类是时间序列分析的一部分,涉及一系列事件。我们提出了将问题转化为多变序列分类任务的新方法。我们使用新表述中后一个领域最先进的算法,以获得比前一个领域最先进的方法更好的准确性。我们讨论了这一工作流程的局限性,并通过开发一种新颖的分类方法来解决这些局限性,即直接按时间间隔顺序运作的COSTI(时间间隔序列分类短),拟议方法保持高度准确性,并取得更好的性能,同时避免与转换数据运行有关的缺陷。我们提出了时间间隔分类问题的一般版本,其中每项活动都以其强度信息作为补充。我们还在这种信息具有重大价值的地方提供了两个新的数据集。