RGB-thermal salient object detection (SOD) aims to segment the common prominent regions of visible image and corresponding thermal infrared image that we call it RGBT SOD. Existing methods don't fully explore and exploit the potentials of complementarity of different modalities and multi-type cues of image contents, which play a vital role in achieving accurate results. In this paper, we propose a multi-interactive dual-decoder to mine and model the multi-type interactions for accurate RGBT SOD. In specific, we first encode two modalities into multi-level multi-modal feature representations. Then, we design a novel dual-decoder to conduct the interactions of multi-level features, two modalities and global contexts. With these interactions, our method works well in diversely challenging scenarios even in the presence of invalid modality. Finally, we carry out extensive experiments on public RGBT and RGBD SOD datasets, and the results show that the proposed method achieves the outstanding performance against state-of-the-art algorithms. The source code has been released at:https://github.com/lz118/Multi-interactive-Dual-decoder.
翻译:RGB-地热显要物体探测(SOD)旨在分割我们称之为RGBT SPOD的可见图像和相应的热红外图像的常见突出区域。现有方法并不完全探索和利用不同模式和图像内容多类型线索的互补潜力,这些方式和线索在取得准确结果方面发挥着至关重要的作用。在本文件中,我们提议为矿山提供一个多互动的双极分解器,并为准确的RGBT SOD建立多类型互动模型。具体地说,我们首先将两种模式编码为多层次的多模式特征演示。然后,我们设计了一种新型双分解器,进行多层次特征、两种模式和全球背景下的互动。随着这些互动,我们的方法在各种挑战性情景下运作良好,即使存在无效模式。最后,我们就公共的 RGBT 和 RGBD SOD 数据集进行了广泛的实验,结果显示,拟议的方法取得了与州-艺术算法的杰出性表现。源代码已经发布在:https://github.com/lz118/Mulde-interactival-interactal。