异常探测,配有超配自动编码器组合组合 (Anomaly Detection With Partitioning Overfitting Autoencoder Ensembles)

In this paper, we propose POTATOES (Partitioning OverfiTting AuTOencoder EnSemble), a new method for unsupervised outlier detection (UOD). More precisely, given any autoencoder for UOD, this technique can be used to improve its accuracy while at the same time removing the burden of tuning its regularization. The idea is to not regularize at all, but to rather randomly partition the data into sufficiently many equally sized parts, overfit each part with its own autoencoder, and to use the maximum over all autoencoder reconstruction errors as the anomaly score. We apply our model to various realistic datasets and show that if the set of inliers is dense enough, our method indeed improves the UOD performance of a given autoencoder significantly. For reproducibility, the code is made available on github so the reader can recreate the results in this paper as well as apply the method to other autoencoders and datasets.

翻译：在本文中,我们建议使用“POTATOES ”, 这是一种不受监督外出检测的新方法(UOD ) 。更确切地说, 如果有UOD的自动编码器, 这种方法可以用来提高它的准确性, 同时消除调整其规范化的负担。想法是完全不规范数据, 而是随机地将数据分成足够多的同等大小的部件, 将每个部件都配上自己的自动编码器, 并使用所有自动编码器重建错误的最大值作为异常分。我们将我们的模型应用到各种现实数据集中, 并显示如果离子集密度足够大, 我们的方法确实可以显著地提高给定的自动编码的 UOD性能。为了复制, 代码可以在 github 上提供, 以便读者可以重新生成此文件中的结果, 并将该方法应用到其他自动编码和数据集中。

相关内容

自编码器

关注 140

自动编码器是一种人工神经网络，用于以无监督的方式学习有效的数据编码。自动编码器的目的是通过训练网络忽略信号“噪声”来学习一组数据的表示（编码），通常用于降维。与简化方面一起，学习了重构方面，在此，自动编码器尝试从简化编码中生成尽可能接近其原始输入的表示形式，从而得到其名称。基本模型存在几种变体，其目的是迫使学习的输入表示形式具有有用的属性。自动编码器可有效地解决许多应用问题，从面部识别到获取单词的语义。

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

【AAAI2021】组合对抗攻击

专知会员服务

51+阅读 · 2021年2月17日