Very large time series are increasingly available from an ever wider range of IoT-enabled sensors, from which significant insights can be obtained through mining temporal patterns from them. A useful type of patterns found in many real-world applications exhibits periodic occurrences, and is thus called seasonal temporal pattern (STP). Compared to regular patterns, mining seasonal temporal patterns is more challenging since traditional measures such as support and confidence do not capture the seasonality characteristics. Further, the anti-monotonicity property does not hold for STPs, and thus, resulting in an exponential search space. This paper presents our Frequent Seasonal Temporal Pattern Mining from Time Series (FreqSTPfTS) solution providing: (1) The first solution for seasonal temporal pattern mining (STPM) from time series that can mine STP at different data granularities. (2) The STPM algorithm that uses efficient data structures and two pruning techniques to reduce the search space and speed up the mining process. (3) An approximate version of STPM that uses mutual information, a measure of data correlation, to prune unpromising time series from the search space. (4) An extensive experimental evaluation showing that STPM outperforms the baseline in runtime and memory consumption, and can scale to big datasets. The approximate STPM is up to an order of magnitude faster and less memory consuming than the baseline, while maintaining high accuracy.
翻译:大量的时间序列越来越多地来自范围日益扩大的由IoT驱动的传感器,从中可以从这些传感器的采矿时间模式中获得重要的洞察力。在许多实际应用中发现的一种有用的模式类型定期发生,因此被称为季节性时间模式。与常规模式相比,采矿季节性时间模式更具挑战性,因为支持和信任等传统措施不能捕捉季节性特征。此外,抗热性财产对于STTP来说并不具有,从而导致一个指数搜索空间。本文展示了我们从时间序列(FreqSTPfTS)中常见的季节性时空模式采矿(FreqSTPfs)解决方案,提供:(1) 时间序列中季节性时间模式采矿(STPM)的第一个解决办法是在不同的数据粒子上进行采矿。(2) 使用高效的数据结构和两种调整技术来减少搜索空间并加快采矿进程。(3) 使用相互信息、数据相关性度的STPMM的近似版本,以便从搜索空间提取时间序列(FreqSTPTS)提供:(1) 广泛的实验性评估,表明STPM的存储率比标准要快得多,而在基线上比高的存储和消费的高度。