Autoencoder, as an essential part of many anomaly detection methods, is lacking flexibility on normal data in complex datasets. U-Net is proved to be effective for this purpose but overfits on the training data if trained by just using reconstruction error similar to other AE-based frameworks. Puzzle-solving, as a pretext task of self-supervised learning (SSL) methods, has earlier proved its ability in learning semantically meaningful features. We show that training U-Nets based on this task is an effective remedy that prevents overfitting and facilitates learning beyond pixel-level features. Shortcut solutions, however, are a big challenge in SSL tasks, including jigsaw puzzles. We propose adversarial robust training as an effective automatic shortcut removal. We achieve competitive or superior results compared to the State of the Art (SOTA) anomaly detection methods on various toy and real-world datasets. Unlike many competitors, the proposed framework is stable, fast, data-efficient, and does not require unprincipled early stopping.
翻译:作为许多异常现象探测方法的一个基本部分,自动编码器作为许多异常现象探测方法的正常数据缺乏灵活性。 U-Net证明在这方面行之有效,但如果仅使用类似于其他基于AE的框架的重建错误进行培训,则培训数据就会过多。作为自我监督学习(SSL)方法的托辞,拼图解答早就证明了它学习语义上有意义的特征的能力。我们表明,基于这项任务的培训U-Net是一种有效的补救办法,防止超额安装并促进学习,超越像素级特性。然而,快捷式解决方案是SSL任务中的一个重大挑战,包括拼图拼图。我们建议,将对抗性强的培训作为一种有效的自动捷径清除方法。我们比艺术状态(SOTA)在各种玩具和现实世界数据集上取得竞争或优越的结果。与许多竞争者不同,拟议的框架是稳定、快速、数据效率高的,不需要未经解释的早期停止。