The continued digitization of societal processes translates into a proliferation of time series data that cover applications such as fraud detection, intrusion detection, and energy management, where anomaly detection is often essential to enable reliability and safety. Many recent studies target anomaly detection for time series data. Indeed, area of time series anomaly detection is characterized by diverse data, methods, and evaluation strategies, and comparisons in existing studies consider only part of this diversity, which makes it difficult to select the best method for a particular problem setting. To address this shortcoming, we introduce taxonomies for data, methods, and evaluation strategies, provide a comprehensive overview of unsupervised time series anomaly detection using the taxonomies, and systematically evaluate and compare state-of-the-art traditional as well as deep learning techniques. In the empirical study using nine publicly available datasets, we apply the most commonly-used performance evaluation metrics to typical methods under a fair implementation standard. Based on the structuring offered by the taxonomies, we report on empirical studies and provide guidelines, in the form of comparative tables, for choosing the methods most suitable for particular application settings. Finally, we propose research directions for this dynamic field.
翻译:社会进程的持续数字化转化为时间序列数据的激增,这些数据涵盖欺诈检测、入侵检测和能源管理等应用,而异常点检测往往是可靠和安全的关键。最近的许多研究都以时间序列数据的异常点检测为目标。事实上,时间序列异常点检测的特点是数据、方法和评估战略多种多样,现有研究的比较仅考虑到这种多样性的一部分,这使得难以为特定问题设置选择最佳方法。为了解决这一缺陷,我们为数据、方法和评估战略引入了分类,全面概述了使用分类法对未经监督的时间序列异常点的检测,系统评估和比较了最新传统和深层学习技术。在使用9个公开数据集进行的经验研究中,我们将最常用的业绩评估指标应用于公平执行标准下的典型方法。根据分类法提供的结构,我们报告经验研究,并以比较表的形式提供指南,用于选择最适合特定应用环境的方法。最后,我们提出这一动态领域的研究方向。