2D-based Industrial Anomaly Detection has been widely discussed, however, multimodal industrial anomaly detection based on 3D point clouds and RGB images still has many untouched fields. Existing multimodal industrial anomaly detection methods directly concatenate the multimodal features, which leads to a strong disturbance between features and harms the detection performance. In this paper, we propose Multi-3D-Memory (M3DM), a novel multimodal anomaly detection method with hybrid fusion scheme: firstly, we design an unsupervised feature fusion with patch-wise contrastive learning to encourage the interaction of different modal features; secondly, we use a decision layer fusion with multiple memory banks to avoid loss of information and additional novelty classifiers to make the final decision. We further propose a point feature alignment operation to better align the point cloud and RGB features. Extensive experiments show that our multimodal industrial anomaly detection model outperforms the state-of-the-art (SOTA) methods on both detection and segmentation precision on MVTec-3D AD dataset. Code is available at https://github.com/nomewang/M3DM.
翻译:2D 以工业异常探测为基础的基于2D 的工业异常探测已经广泛讨论,但是,基于3D点云和RGB图像的多式联运工业异常探测仍然有许多未触及的领域。现有的多式联运工业异常探测方法直接结合了多式联运特征,导致特征之间的强烈干扰和探测性能的损害。在本文件中,我们提议采用一种新型的多式联运异常探测方法M3D-Memory(M3DM),这是一种新型的多式联运异常探测方法,同时结合了混合式反差学习,以鼓励不同模式特征的互动;第二,我们使用与多个记忆库的决定层结合,以避免信息丢失,并增加新颖分类,以作出最后决定。我们进一步提议一个点特征调整操作,以更好地将点云和RGB特征相匹配。广泛的实验表明,我们的多式联运工业异常探测模型在MVTec-3DAD数据集的检测和分解精确度方面都超过了“SOTAD”(SOTA)的状态方法。代码见https://github.com/nomewang/MDDD。</s>