Industrial defect detection is commonly addressed with anomaly detection (AD) methods where no or only incomplete data of potentially occurring defects is available. This work discovers previously unknown problems of student-teacher approaches for AD and proposes a solution, where two neural networks are trained to produce the same output for the defect-free training examples. The core assumption of student-teacher networks is that the distance between the outputs of both networks is larger for anomalies since they are absent in training. However, previous methods suffer from the similarity of student and teacher architecture, such that the distance is undesirably small for anomalies. For this reason, we propose asymmetric student-teacher networks (AST). We train a normalizing flow for density estimation as a teacher and a conventional feed-forward network as a student to trigger large distances for anomalies: The bijectivity of the normalizing flow enforces a divergence of teacher outputs for anomalies compared to normal data. Outside the training distribution the student cannot imitate this divergence due to its fundamentally different architecture. Our AST network compensates for wrongly estimated likelihoods by a normalizing flow, which was alternatively used for anomaly detection in previous work. We show that our method produces state-of-the-art results on the two currently most relevant defect detection datasets MVTec AD and MVTec 3D-AD regarding image-level anomaly detection on RGB and 3D data.
翻译:研究发现以前不为人知的师生对AD的方法问题,并提出解决办法,即两个神经网络经过培训,为无缺陷培训实例产生同样的产出。 学生-教师网络的核心假设是,这两个网络产出之间的距离对于异常点来说,因为没有接受培训,因此异常点的教师和学生网络的核心假设是,这两个网络产出之间的距离对异常点而言较大。然而,以前的方法与学生和教师结构相似,因此,异常点的距离极小。为此,我们建议建立不对称的师生网络(AST)。为此,我们提议了不对称的学生-教师网络(AST)。我们培训一个正常化的流量,以便作为教师进行密度估计,而一个传统的前方供餐网络,作为学生进行训练,以便产生与无缺陷培训实例相同的产出。 学生-教师网络的核心假设是:正常流动的双向点使教师产出与正常数据存在差异。在培训分布之外,学生无法因根本不同的结构而模仿这种差异。我们的AST网络通过正常流来弥补了错误估计的可能性,而这种可能性在先前的工作中被用于异常点检测异常点。 我们展示了我们的方法在SOD 3号的检测数据上产生了状态。