This article proposes novel rules for false discovery rate control (FDRC) geared towards online anomaly detection in time series. Online FDRC rules allow to control the properties of a sequence of statistical tests. In the context of anomaly detection, the null hypothesis is that an observation is normal and the alternative is that it is anomalous. FDRC rules allow users to target a lower bound on precision in unsupervised settings. The methods proposed in this article overcome short-comings of previous FDRC rules in the context of anomaly detection, in particular ensuring that power remains high even when the alternative is exceedingly rare (typical in anomaly detection) and the test statistics are serially dependent (typical in time series). We show the soundness of these rules in both theory and experiments.
翻译:本条提出了针对时间序列中在线异常点检测的虚假发现率控制(FDRC)新新规则。在线FDRC规则允许控制统计测试序列的特性。在异常点检测方面,无效假设是观察是正常的,替代是不正常的。FDRC规则允许用户在未受监督的环境中以较低程度的精确度为目标。本条建议的方法克服了FDRC规则在异常点检测方面的缺陷,特别是确保即使在替代方法极为罕见(异常点检测通常)和测试统计数据具有序列依赖性(时间序列通常)的情况下,权力仍然很高。我们在理论和实验中都表明了这些规则的正确性。