Identifying defects in the images of industrial products has been an important task to enhance quality control and reduce maintenance costs. In recent studies, industrial anomaly detection models were developed using pre-trained networks to learn nominal representations. To employ the relative positional information of each pixel, we present \textit{\textbf{N-pad}}, a novel method for anomaly detection and segmentation in a one-class learning setting that includes the neighborhood of the target pixel for model training and evaluation. Within the model architecture, pixel-wise nominal distributions are estimated by using the features of neighboring pixels with the target pixel to allow possible marginal misalignment. Moreover, the centroids from clusters of nominal features are identified as a representative nominal set. Accordingly, anomaly scores are inferred based on the Mahalanobis distances and Euclidean distances between the target pixel and the estimated distributions or the centroid set, respectively. Thus, we have achieved state-of-the-art performance in MVTec-AD with AUROC of 99.37 for anomaly detection and 98.75 for anomaly segmentation, reducing the error by 34\% compared to the next best performing model. Experiments in various settings further validate our model.
翻译:在最近的研究中,利用经过培训的网络,开发了工业异常检测模型,以学习名义表示;为了使用每个像素的相对位置信息,我们提供了一种在单级学习环境中发现和分解异常现象的新方法,其中包括用于示范培训和评估的目标像素附近地带;在模型结构中,通过使用与目标像素相邻的像素特征,利用目标像素相邻的像素特征来估计像素的名义分布;此外,从名义特征组群中确定有代表性的名义表示组别,根据马哈拉诺比的距离和Eucloidean的距离分别推断出异常检测和分解的新方法;因此,我们在MVTec-AD和AUROC的下一个模型(99.37)取得了最新水平的性能表现,在异常现象分解方面进一步进行了98.75的模型。此外,根据马哈拉诺比斯的距离和Eucloidean的距离,分别根据目标像素和估计分布或中子集,得出异常分数的分数。