The goal of anomaly detection is to identify anomalous samples from normal ones. In this paper, a small number of anomalies are assumed to be available at the training stage, but they are assumed to be collected only from several anomaly types, leaving the majority of anomaly types not represented in the collected anomaly dataset at all. To effectively leverage this kind of incomplete anomalous knowledge represented by the collected anomalies, we propose to learn a probability distribution that can not only model the normal samples, but also guarantee to assign low density values for the collected anomalies. To this end, an anomaly-aware generative adversarial network (GAN) is developed, which, in addition to modeling the normal samples as most GANs do, is able to explicitly avoid assigning probabilities for collected anomalous samples. Moreover, to facilitate the computation of anomaly detection criteria like reconstruction error, the proposed anomaly-aware GAN is designed to be bidirectional, attaching an encoder for the generator. Extensive experimental results demonstrate that our proposed method is able to effectively make use of the incomplete anomalous information, leading to significant performance gains compared to existing methods.
翻译:异常点检测的目的是从正常的异常点中找出异常点样本。本文假定在培训阶段可以找到少量异常点,但假定仅从几种异常点类型中收集,使收集的异常点数据集中没有任何代表的多数异常点类型得以收集。为了有效地利用所收集的异常点代表的这种不完整异常点知识,我们建议了解一种概率分布,这种分布不仅能够模拟正常的样本,而且保证为所收集的异常点定出低密度值。为此,我们开发了一个异常点觉察识的基因对抗网络(GAN),除了像大多数GANs那样对正常的样本进行模拟外,还能够明确避免为所收集的异常点样本分配概率。此外,为了便利对异常点检测标准的计算,例如重建错误,拟议的异常点GAN是双向的,为发电机附加一个编码器。广泛的实验结果表明,我们所提议的方法能够有效地利用不完整的异常点信息,从而与现有方法相比,取得显著的业绩收益。