Given a multivariate big time series, can we detect anomalies as soon as they occur? Many existing works detect anomalies by learning how much a time series deviates away from what it should be in the reconstruction framework. However, most models have to cut the big time series into small pieces empirically since optimization algorithms cannot afford such a long series. The question is raised: do such cuts pollute the inherent semantic segments, like incorrect punctuation in sentences? Therefore, we propose a reconstruction-based anomaly detection method, MissGAN, iteratively learning to decode and encode naturally smooth time series in coarse segments, and finding out a finer segment from low-dimensional representations based on HMM. As a result, learning from multi-scale segments, MissGAN can reconstruct a meaningful and robust time series, with the help of adversarial regularization and extra conditional states. MissGAN does not need labels or only needs labels of normal instances, making it widely applicable. Experiments on industrial datasets of real water network sensors show our MissGAN outperforms the baselines with scalability. Besides, we use a case study on the CMU Motion dataset to demonstrate that our model can well distinguish unexpected gestures from a given conditional motion.
翻译:在一个多变的大型时间序列中, 我们能否在异常现象发生时立即发现异常现象? 许多现有工作通过了解时间序列与重建框架中应该有的差别来检测异常现象。 然而, 大多数模型必须从经验上将大时间序列切成小块, 因为优化算法无法支付如此长的序列。 问题在于: 这样的削减是否污染了内在的语义部分, 比如不正确的句号标出? 因此, 我们建议一种基于重建的异常现象检测方法, MissGAN, 反复学习在粗略的段解码和编码自然平稳的时间序列, 从基于 HMMM 的低维度表达中找到一个更精细的部分。 结果, 从多尺度部分中学习, MissGAN 可以重建一个有意义和稳健的时间序列, 帮助进行对抗性调节和附加条件状态 。 MissGAN 不需要标签或仅需要普通例子标签, 使其广泛适用 。 在实际水网络传感器的工业数据集上进行实验, 显示我们的MissGAN 超越了基线的可缩略性。 此外, 我们用一个无法理解的模型来辨别我们的CMUP 。