Recently, anomaly detection and localization in multimedia data have received significant attention among the machine learning community. In real-world applications such as medical diagnosis and industrial defect detection, anomalies only present in a fraction of the images. To extend the reconstruction-based anomaly detection architecture to the localized anomalies, we propose a self-supervised learning approach through random masking and then restoring, named Self-Supervised Masking (SSM) for unsupervised anomaly detection and localization. SSM not only enhances the training of the inpainting network but also leads to great improvement in the efficiency of mask prediction at inference. Through random masking, each image is augmented into a diverse set of training triplets, thus enabling the autoencoder to learn to reconstruct with masks of various sizes and shapes during training. To improve the efficiency and effectiveness of anomaly detection and localization at inference, we propose a novel progressive mask refinement approach that progressively uncovers the normal regions and finally locates the anomalous regions. The proposed SSM method outperforms several state-of-the-arts for both anomaly detection and anomaly localization, achieving 98.3% AUC on Retinal-OCT and 93.9% AUC on MVTec AD, respectively.
翻译:最近,多媒体数据中的异常检测和本地化在机器学习界受到极大关注。在医学诊断和工业缺陷检测等现实世界应用中,只有一小部分图像中存在异常现象。为了将重建的异常检测结构扩大到局部异常现象,我们建议通过随机遮罩和恢复,采用自我监督的学习方法,称为自我监督的遮罩(SSSM),用于不受监督的异常检测和本地化。SSSM不仅加强了对涂料网络的培训,还导致推断时蒙面预测效率的极大提高。通过随机遮罩,每个图像都扩大成一套不同的培训三重体,从而使自动编码器能够在培训期间学会用不同尺寸和形状的遮罩进行重建。为了提高异常检测和本地化的效率和效力,我们提出了一种新的渐进式遮罩改进方法,逐步发现正常区域,并最终找到异常区域。拟议的SSSSMU方法在异常检测和异常地方化方面优于数个状态,分别实现了98.3%的AMV-AST ADUCA-9和93 % ADUCA-AD-ADUCADAD-9,分别实现了98.3和ACT-ACT-ACT-ADUC)