Anomaly detection has recently gained increasing attention in the field of computer vision, likely due to its broad set of applications ranging from product fault detection on industrial production lines and impending event detection in video surveillance to finding lesions in medical scans. Regardless of the domain, anomaly detection is typically framed as a one-class classification task, where the learning is conducted on normal examples only. An entire family of successful anomaly detection methods is based on learning to reconstruct masked normal inputs (e.g. patches, future frames, etc.) and exerting the magnitude of the reconstruction error as an indicator for the abnormality level. Unlike other reconstruction-based methods, we present a novel self-supervised masked convolutional transformer block (SSMCTB) that comprises the reconstruction-based functionality at a core architectural level. The proposed self-supervised block is extremely flexible, enabling information masking at any layer of a neural network and being compatible with a wide range of neural architectures. In this work, we extend our previous self-supervised predictive convolutional attentive block (SSPCAB) with a 3D masked convolutional layer, as well as a transformer for channel-wise attention. Furthermore, we show that our block is applicable to a wider variety of tasks, adding anomaly detection in medical images and thermal videos to the previously considered tasks based on RGB images and surveillance videos. We exhibit the generality and flexibility of SSMCTB by integrating it into multiple state-of-the-art neural models for anomaly detection, bringing forth empirical results that confirm considerable performance improvements on five benchmarks: MVTec AD, BRATS, Avenue, ShanghaiTech, and Thermal Rare Event. We release our code and data as open source at https://github.com/ristea/ssmctb.
翻译:最近,在计算机视觉领域,异常探测最近日益受到越来越多的关注,这很可能是由于计算机视觉领域应用范围广泛,从工业生产线的产品故障检测和视频监控中即将发生的事件检测到医学扫描中的损伤。不管领域如何,异常探测通常被设置为单级分类任务,仅以正常实例进行学习。一系列成功的异常探测方法都以学习重建隐藏的正常投入(如补丁、未来框架等)为基础,并将重建错误的规模作为异常程度的指标。与其他基于重建的方法不同,我们展示了一种全新的自我监督的多层混合变压变压器块(SSMCTB),在核心建筑一级包含基于重建的功能。拟议的自我监督部分非常灵活,在神经网络的任何一层都能够掩盖信息,并且与广泛的神经结构兼容。在这项工作中,我们以自我监督的正常变压变现基准(SS-PCAB)为基础,在3DMy-变压式变压式变压图像中展示了一个新的变压变压图像,在常规图像中将我们变现的变压的变现过程,在常规图像上将我们变现,将我们变现的变现的变压的变现,将一个图像转化为的变现的图像显示过程,将我们变现到一个图像的变现,将我们变现的变现的变现的变现到的变现的图像展示的变现的变现,将我们的图像的变现的图像显示的变现到的变现的变现过程的图像显示过程。