Most existing RGB-D salient object detection (SOD) models require large computational costs and memory consumption to accurately detect the salient objects. This limits the real-life applications of these RGB-D SOD models. To address this issue, a novel lightweight RGB-D SOD model is presented in this paper. Different from most existing models which usually employ the two-stream or single-stream structure, we propose to employ the middle-level fusion structure for designing lightweight RGB-D SOD model, due to the fact that the middle-level fusion structure can simultaneously exploit the modality-shared and modality-specific information as the two-stream structure and can significantly reduce the network's parameters as the single-stream structure. Based on this structure, a novel information-aware multi-modal feature fusion (IMFF) module is first designed to effectively capture the cross-modal complementary information. Then, a novel lightweight feature-level and decision-level feature fusion (LFDF) module is designed to aggregate the feature-level and the decision-level saliency information in different stages with less parameters. With IMFF and LFDF modules incorporated in the middle-level fusion structure, our proposed model has only 3.9M parameters and runs at 33 FPS. Furthermore, the experimental results on several benchmark datasets verify the effectiveness and superiority of the proposed method over some state-of-the-art methods.
翻译:现有大多数RGB-D显著物体探测模型(SOD)要求大量计算成本和内存消耗,以准确检测突出物体。这限制了这些RGB-D SOD模型的实际应用。为了解决这一问题,本文件介绍了一个新的轻量RGB-D SOD模型。与通常使用双流或单流结构的大多数现有模型不同,我们提议采用中级混合结构来设计轻量RGB-D SOD模型,因为中级混合结构可以同时利用作为双流结构的共享模式和特定模式的信息,并大大降低作为单一流结构的网络参数。基于这一结构,一个新的信息觉悟多模式组合模型首先设计以有效获取跨模式补充信息。随后,我们设计了一个新型轻量级特征级和决定级特征聚合模型(LFDF)模块,将不同阶段的特征级和决策级显著信息与较低参数结合起来。IMFFFF和MFR标准标准模型的中期测试模块在FSBS标准等级上运行了某些标准。