ESAD: 端到端深半监督半监督异常探测 (ESAD: End-to-end Deep Semi-supervised Anomaly Detection)

This paper explores semi-supervised anomaly detection, a more practical setting for anomaly detection where a small set of labeled outlier samples are provided in addition to a large amount of unlabeled data for training. Rethinking the optimization target of anomaly detection, we propose a new objective function that measures the KL-divergence between normal and anomalous data, and prove that two factors: the mutual information between the data and latent representations, and the entropy of latent representations, constitute an integral objective function for anomaly detection. To resolve the contradiction in simultaneously optimizing the two factors, we propose a novel encoder-decoder-encoder structure, with the first encoder focusing on optimizing the mutual information and the second encoder focusing on optimizing the entropy. The two encoders are enforced to share similar encoding with a consistent constraint on their latent representations. Extensive experiments have revealed that the proposed method significantly outperforms several state-of-the-arts on multiple benchmark datasets, including medical diagnosis and several classic anomaly detection benchmarks.

翻译：本文探讨半监督的异常点检测,这是一个更实际的异常点检测环境,其中除了提供大量未贴标签的培训数据外,还提供了少量贴标签的异常点检测样本。重新思考异常点检测的最优化目标,我们建议一个新的客观功能,以测量正常数据和异常数据之间的 KL 差异度,并证明两个因素:数据和潜表层之间的相互信息,以及潜表层的酶,构成了异常点检测的一个整体客观功能。为了解决这一矛盾,同时优化这两个因素,我们建议建立一个新型的编码器结构,第一个编码器侧重于优化相互信息,第二个编码器侧重于优化对恒点的优化。两个编码器被强制使用来共享类似的编码,同时始终限制其潜在代表面。广泛的实验表明,拟议的方法大大超越了多个基准数据集的若干项状态,包括医学诊断和若干典型异常点检测基准。

相关内容

异常检测

关注 102

在数据挖掘中，异常检测（英语：anomaly detection）对不符合预期模式或数据集中其他项目的项目、事件或观测值的识别。通常异常项目会转变成银行欺诈、结构缺陷、医疗问题、文本错误等类型的问题。异常也被称为离群值、新奇、噪声、偏差和例外。特别是在检测滥用与网络入侵时，有趣性对象往往不是罕见对象，但却是超出预料的突发活动。这种模式不遵循通常统计定义中把异常点看作是罕见对象，于是许多异常检测方法（特别是无监督的方法）将对此类数据失效，除非进行了合适的聚集。相反，聚类分析算法可能可以检测出这些模式形成的微聚类。有三大类异常检测方法。[1] 在假设数据集中大多数实例都是正常的前提下，无监督异常检测方法能通过寻找与其他数据最不匹配的实例来检测出未标记测试数据的异常。监督式异常检测方法需要一个已经被标记“正常”与“异常”的数据集，并涉及到训练分类器（与许多其他的统计分类问题的关键区别是异常检测的内在不均衡性）。半监督式异常检测方法根据一个给定的正常训练数据集创建一个表示正常行为的模型，然后检测由学习模型生成的测试实例的可能性。