Deep convolutional autoencoders provide an effective tool for learning non-linear dimensionality reduction in an unsupervised way. Recently, they have been used for the task of anomaly detection in the visual domain. By optimising for the reconstruction error using anomaly-free examples, the common belief is that a trained network will have difficulties to reconstruct anomalous parts during the test phase. This is usually done by controlling the capacity of the network by either reducing the size of the bottleneck layer or enforcing sparsity constraints on its activations. However, neither of these techniques does explicitly penalise reconstruction of anomalous signals often resulting in a poor detection. We tackle this problem by adapting a self-supervised learning regime which allows to use discriminative information during training while regularising the model to focus on the data manifold by means of a modified reconstruction error resulting in an accurate detection. Unlike related approaches, the inference of the proposed method during training and prediction is very efficient processing the whole input image in one single step. Our experiments on the MVTec Anomaly Detection dataset demonstrate high recognition and localisation performance of the proposed method. On the texture-subset, in particular, our approach consistently outperforms a bunch of recent anomaly detection methods by a big margin.
翻译:深相自动电解码器提供了一种有效的工具,用于以不受监督的方式学习非线性维度的减少。 最近,它们被用于在视觉领域探测异常现象的任务。 通过利用无异常实例优化重建错误,人们普遍认为,在试验阶段,经过训练的网络将难以重建异常部分。通常通过降低瓶颈层的大小或对启动过程实施松散限制来控制网络的能力。然而,这些技术都没有明确惩罚经常导致检测不良的异常信号重建。我们通过调整自上而下的学习制度来解决这个问题,这种制度允许在培训期间使用歧视性信息,同时将模型正规化,通过修改后重建错误的方法集中关注数据,从而导致准确检测。与相关方法不同,在培训和预测期间拟议方法的推论非常高效地处理整个输入图像。我们在MV Tec 异常探测数据设置方面的实验显示,最近以高水平检测方法在高水平上的表现和本地化。