现行时间序列反常探测基准被破坏,正在创造进步的幻觉 (Current Time Series Anomaly Detection Benchmarks are Flawed and are Creating the Illusion of Progress)

Time series anomaly detection has been a perennially important topic in data science, with papers dating back to the 1950s. However, in recent years there has been an explosion of interest in this topic, much of it driven by the success of deep learning in other domains and for other time series tasks. Most of these papers test on one or more of a handful of popular benchmark datasets, created by Yahoo, Numenta, NASA, etc. In this work we make a surprising claim. The majority of the individual exemplars in these datasets suffer from one or more of four flaws. Because of these four flaws, we believe that many published comparisons of anomaly detection algorithms may be unreliable, and more importantly, much of the apparent progress in recent years may be illusionary. In addition to demonstrating these claims, with this paper we introduce the UCR Time Series Anomaly Archive. We believe that this resource will perform a similar role as the UCR Time Series Classification Archive, by providing the community with a benchmark that allows meaningful comparisons between approaches and a meaningful gauge of overall progress.

翻译：时间序列异常现象探测是数据科学中一个常年重要话题,论文可追溯到1950年代。然而,近年来,人们对这个话题的兴趣大增,其中很大一部分是由于其他领域和其他时间序列任务中深层学习的成功所推动的。这些论文大多是对亚胡、努门塔、美国航天局等亚虎、努门塔、美国航天局制作的少数流行基准数据集中的一个或数个进行测试。在这项工作中,我们提出一个令人惊讶的要求。这些数据集中的大多数个人模型都存在一个或四个以上的缺陷。由于这四个缺陷,我们认为,许多公布的异常检测算法比较可能不可靠,更重要的是,近年来许多明显的进展可能是幻想性的。除了展示这些说法外,我们介绍UCR Time系列阿诺玛利档案。我们相信,这一资源将发挥类似于UCR Time系列分类档案的类似作用,为社区提供一个基准,使其能够对各种方法进行有意义的比较,并对总体进展进行有意义的衡量。

相关内容

异常检测

关注 102

在数据挖掘中，异常检测（英语：anomaly detection）对不符合预期模式或数据集中其他项目的项目、事件或观测值的识别。通常异常项目会转变成银行欺诈、结构缺陷、医疗问题、文本错误等类型的问题。异常也被称为离群值、新奇、噪声、偏差和例外。特别是在检测滥用与网络入侵时，有趣性对象往往不是罕见对象，但却是超出预料的突发活动。这种模式不遵循通常统计定义中把异常点看作是罕见对象，于是许多异常检测方法（特别是无监督的方法）将对此类数据失效，除非进行了合适的聚集。相反，聚类分析算法可能可以检测出这些模式形成的微聚类。有三大类异常检测方法。[1] 在假设数据集中大多数实例都是正常的前提下，无监督异常检测方法能通过寻找与其他数据最不匹配的实例来检测出未标记测试数据的异常。监督式异常检测方法需要一个已经被标记“正常”与“异常”的数据集，并涉及到训练分类器（与许多其他的统计分类问题的关键区别是异常检测的内在不均衡性）。半监督式异常检测方法根据一个给定的正常训练数据集创建一个表示正常行为的模型，然后检测由学习模型生成的测试实例的可能性。

【深度学习社区检测】Deep Learning for Community Detection: Progress, Challenges and Opportunities

专知会员服务

28+阅读 · 2020年6月13日

【深度伪造综述论文】The Creation and Detection of Deepfakes: A Survey

专知会员服务

55+阅读 · 2020年4月26日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

【O'Reilly AI Conference 2019】使用深度学习进行异常检测以测量大型数据集的质量（Anomaly detection using deep learning to measure the quality of large datasets），BlueWhale的联合创始人兼CTO Sridhar Alla

专知会员服务

28+阅读 · 2019年11月5日