Anomaly detection is a challenging task and usually formulated as an unsupervised learning problem for the unexpectedness of anomalies. This paper proposes a simple yet powerful approach to this issue, which is implemented in the student-teacher framework for its advantages but substantially extends it in terms of both accuracy and efficiency. Given a strong model pre-trained on image classification as the teacher, we distill the knowledge into a single student network with the identical architecture to learn the distribution of anomaly-free images and this one-step transfer preserves the crucial clues as much as possible. Moreover, we integrate the multi-scale feature matching strategy into the framework, and this hierarchical feature alignment enables the student network to receive a mixture of multi-level knowledge from the feature pyramid under better supervision, thus allowing to detect anomalies of various sizes. The difference between feature pyramids generated by the two networks serves as a scoring function indicating the probability of anomaly occurring. Due to such operations, our approach achieves accurate and fast pixel-level anomaly detection. Very competitive results are delivered on three major benchmarks, significantly superior to the state of the art ones. In addition, it makes inferences at a very high speed (with 100 FPS for images of the size at 256x256), at least dozens of times faster than the latest counterparts.
翻译:异常的检测是一项艰巨的任务,通常被设计成对异常现象出乎意料的未受监督的学习问题。本文件提出一个简单而有力的方法,在师生框架内实施,以体现其优点,但从准确性和效率两方面大大扩展。鉴于在作为教师的图像分类方面训练有素的强大模型,我们将知识注入一个具有相同结构的单一学生网络,以了解无异常图像的分布,而这一一步骤的传输尽可能保存关键线索。此外,我们将多尺度特征匹配战略纳入框架,这种分级特征匹配使学生网络能够在更好的监督下获得来自特征金字塔的多层次知识的混合,从而能够发现不同大小的异常现象。这两个网络产生的特征金字塔之间的差别是一个分数函数,表明出现异常的可能性。由于这种操作,我们的方法实现了准确和快速的像素级异常检测。在三大基准上取得了非常有竞争力的结果,大大优于艺术品的状态。此外,在最高速的256级图像中,在最慢的25到最快的25个比例。