Time series research has gathered lots of interests in the last decade, especially for Time Series Classification (TSC) and Time Series Forecasting (TSF). Research in TSC has greatly benefited from the University of California Riverside and University of East Anglia (UCR/UEA) Time Series Archives. On the other hand, the advancement in Time Series Forecasting relies on time series forecasting competitions such as the Makridakis competitions, NN3 and NN5 Neural Network competitions, and a few Kaggle competitions. Each year, thousands of papers proposing new algorithms for TSC and TSF have utilized these benchmarking archives. These algorithms are designed for these specific problems, but may not be useful for tasks such as predicting the heart rate of a person using photoplethysmogram (PPG) and accelerometer data. We refer to this problem as Time Series Extrinsic Regression (TSER), where we are interested in a more general methodology of predicting a single continuous value, from univariate or multivariate time series. This prediction can be from the same time series or not directly related to the predictor time series and does not necessarily need to be a future value or depend heavily on recent values. To the best of our knowledge, research into TSER has received much less attention in the time series research community and there are no models developed for general time series extrinsic regression problems. Most models are developed for a specific problem. Therefore, we aim to motivate and support the research into TSER by introducing the first TSER benchmarking archive. This archive contains 19 datasets from different domains, with varying number of dimensions, unequal length dimensions, and missing values. In this paper, we introduce the datasets in this archive and did an initial benchmark on existing models.
翻译:在过去十年里,时间序列的研究收集了许多兴趣,特别是在时间序列分类(TSC)和时间序列预测(TSF)方面。在TSC的研究极大地受益于加利福尼亚河滨大学和东安吉利亚大学(UCR/UEA)时间序列档案。另一方面,时间序列预测的进展依赖于时间序列预测竞赛,如马克里达基斯竞赛、NNN3和NNN5神经网络竞赛,以及一些卡格格勒竞赛。每年,数千份提议TSC和TSF新算法的文件都利用了这些基准评估档案。这些算法是针对这些具体问题设计的,但对于预测一个人的心率(UCRPG/UEA)时间序列(UCR)和时间序列数据等任务可能没有多大用处。我们提到这个问题,比如时间序列的缺分数,Extrinci Regresionionion(TSER),我们感兴趣的是预测单一连续值的比较方法,从未读或多变数时间序列开始。这个预测可能来自同一时间序列或远值的SER数据序列, 也不一定直接地需要预测一个不同的时间序列。