Anomaly detection is commonly pursued as a one-class classification problem, where models can only learn from normal training samples, while being evaluated on both normal and abnormal test samples. Among the successful approaches for anomaly detection, a distinguished category of methods relies on predicting masked information (e.g. patches, future frames, etc.) and leveraging the reconstruction error with respect to the masked information as an abnormality score. Different from related methods, we propose to integrate the reconstruction-based functionality into a novel self-supervised predictive architectural building block. The proposed self-supervised block is generic and can easily be incorporated into various state-of-the-art anomaly detection methods. Our block starts with a convolutional layer with dilated filters, where the center area of the receptive field is masked. The resulting activation maps are passed through a channel attention module. Our block is equipped with a loss that minimizes the reconstruction error with respect to the masked area in the receptive field. We demonstrate the generality of our block by integrating it into several state-of-the-art frameworks for anomaly detection on image and video, providing empirical evidence that shows considerable performance improvements on MVTec AD, Avenue, and ShanghaiTech.
翻译:异常探测通常作为一种单级分类问题进行,模型只能从正常培训样本中学习,同时在正常和异常测试样本中进行评估。在异常检测的成功方法中,有一类不同的方法依赖于预测蒙面信息(如补丁、未来框架等),并利用蒙面信息的重建错误作为异常分数。与相关方法不同,我们提议将重建功能纳入一个新的自我监督的预测建筑构件。拟议的自我监督构件是通用的,可以很容易地纳入各种最新异常检测方法。我们的区块从带有扩张过滤器的革命层开始,即接受场的中心区域被遮掩。由此产生的启动地图通过一个频道注意模块传递。我们的区块装备的损失是尽量减少与接受场遮面区域有关的重建错误。我们通过将它纳入图像和视频异常检测的若干最先进的框架,提供实验性证据,显示上海大道和上海大道的显著性能改进。