Detection of object anomalies is crucial in industrial processes, but unsupervised anomaly detection and localization is particularly important due to the difficulty of obtaining a large number of defective samples and the unpredictable types of anomalies in real life. Among the existing unsupervised anomaly detection and localization methods, the NF-based scheme has achieved better results. However, the two subnets (complex functions) $s_{i}(u_{i})$ and $t_{i}(u_{i})$ in NF are usually multilayer perceptrons, which need to squeeze the input visual features from 2D flattening to 1D, destroying the spatial location relationship in the feature map and losing the spatial structure information. In order to retain and effectively extract spatial structure information, we design in this study a complex function model with alternating CBAM embedded in a stacked $3\times3$ full convolution, which is able to retain and effectively extract spatial structure information in the normalized flow model. Extensive experimental results on the MVTec AD dataset show that CAINNFlow achieves advanced levels of accuracy and inference efficiency based on CNN and Transformer backbone networks as feature extractors, and CAINNFlow achieves a pixel-level AUC of $98.64\%$ for anomaly detection in MVTec AD.
翻译:在工业过程中,对物体异常现象的检测至关重要,但是,由于难以获得大量有缺陷的样本和真实生活中不可预测的异常类型,在实际生活中难以获得大量有缺陷的样本和难以预测的异常类型,在现有的未经监督的异常现象检测和本地化方法中,基于NF的计划取得了更好的结果,然而,NF的两个子网(复合功能)$s ⁇ i}(u ⁇ i})(u ⁇ i})美元和$t ⁇ i}(u ⁇ })美元通常是多层透视器,需要将输入的视觉特征从平压到1D,破坏地貌图的空间定位关系并丢失空间结构信息。为了保留和有效提取空间结构信息,我们在本研究中设计了一个复杂的功能模型,将CBAM(复合功能功能)嵌入堆放的3美元时间3美元全变幻剂中,能够保留和有效地提取正常流动模型中的空间结构信息。MVTTTeec ADS的大规模实验结果表明,CANFLlow 达到高级的精确度和误差度水平,这是以CNIS和CMVCMVCSML的升级为标准的测测距8。