The essence of unsupervised anomaly detection is to learn the compact distribution of normal samples and detect outliers as anomalies in testing. Meanwhile, the anomalies in real-world are usually subtle and fine-grained in a high-resolution image especially for industrial applications. Towards this end, we propose a novel framework for unsupervised anomaly detection and localization. Our method aims at learning dense and compact distribution from normal images with a coarse-to-fine alignment process. The coarse alignment stage standardizes the pixel-wise position of objects in both image and feature levels. The fine alignment stage then densely maximizes the similarity of features among all corresponding locations in a batch. To facilitate the learning with only normal images, we propose a new pretext task called non-contrastive learning for the fine alignment stage. Non-contrastive learning extracts robust and discriminating normal image representations without making assumptions on abnormal samples, and it thus empowers our model to generalize to various anomalous scenarios. Extensive experiments on two typical industrial datasets of MVTec AD and BenTech AD demonstrate that our framework is effective in detecting various real-world defects and achieves a new state-of-the-art in industrial unsupervised anomaly detection.
翻译:未经监督的异常点探测的精髓是学习正常样本的紧凑分布,并检测出异常点作为测试中的异常点。与此同时,现实世界中的异常点通常是在高分辨率图像中微妙和细微的,特别是在工业应用方面。为此,我们提议了一个未经监督的异常点探测和本地化的新框架。我们的方法是学习以粗到软的校正进程从正常图像中密集和紧凑分布,以普通图像中粗略到软的校正进程;粗略的校正阶段使图像和特征水平中对象的像素定位标准化。随后的细微调整阶段将所有相应地点的相似性最大化。为了便利仅用普通图像进行学习,我们提出了一个新的托辞任务,即为细微校正阶段进行非动态学习。非动态学习提取了稳健而有区别的正常图像演示,而不对异常样本进行假设,从而使我们的模型能够对各种异常情景进行概括。关于MVTec AD和BenTech AD的两种典型的典型工业数据集进行广泛的实验,表明我们的框架在检测各种真实世界中能够有效地探测到各种异常的缺陷。