Time series analysis has achieved great success in diverse applications such as network security, environmental monitoring, and medical informatics. Learning similarities among different time series is a crucial problem since it serves as the foundation for downstream analysis such as clustering and anomaly detection. It often remains unclear what kind of distance metric is suitable for similarity learning due to the complex temporal dynamics of the time series generated from event-triggered sensing, which is common in diverse applications, including automated driving, interactive healthcare, and smart home automation. The overarching goal of this paper is to develop an unsupervised learning framework that is capable of learning task-aware similarities among unlabeled event-triggered time series. From the machine learning vantage point, the proposed framework harnesses the power of both hierarchical multi-scale sequence autoencoders and Gaussian Mixture Model (GMM) to effectively learn the low-dimensional representations from the time series. Finally, the obtained similarity measure can be easily visualized for explaining. The proposed framework aspires to offer a stepping stone that gives rise to a systematic approach to model and learn similarities among a multitude of event-triggered time series. Through extensive qualitative and quantitative experiments, it is revealed that the proposed method outperforms state-of-the-art methods considerably.
翻译:时间序列分析在网络安全、环境监测和医疗信息学等各种应用中取得了巨大成功。学习不同时间序列之间的相似性是一个关键问题,因为它是集群和异常探测等下游分析的基础。由于由事件触发的遥感产生的时间序列具有复杂的时间动态,通常仍然不清楚哪种远程计量标准适合类似性学习,这在包括自动化驾驶、交互式保健和智能家庭自动化在内的各种应用中是常见的。本文件的总目标是开发一个不受监督的学习框架,能够学习未标记的事件触发时间序列之间的相似性。从机器学习范式点看,拟议的框架利用了等级多级自动测序和高斯混集模型(GMMM)的力量,以便有效地学习时间序列中低维度的表述。最后,获得的类似性计量方法可以很容易被直观地加以解释。拟议的框架希望提供一个跳板,能够形成一个系统化的模型,并学习大量事件错位时间序列之间的相似性。拟议的定性和定量实验方法大大地揭示了该方法。