Multimodal recommender systems (MRS) improve recommendation performance by integrating complementary semantic information from multiple modalities. However, the assumption of complete multimodality rarely holds in practice due to missing images and incomplete descriptions, hindering model robustness and generalization. To address these challenges, we introduce a novel method called \textbf{I$^3$-MRec}, which uses \textbf{I}nvariant learning with \textbf{I}nformation bottleneck principle for \textbf{I}ncomplete \textbf{M}odality \textbf{Rec}ommendation. To achieve robust performance in missing modality scenarios, I$^3$-MRec enforces two pivotal properties: (i) cross-modal preference invariance, ensuring consistent user preference modeling across varying modality environments, and (ii) compact yet effective multimodal representation, as modality information becomes unreliable in such scenarios, reducing the dependence on modality-specific information is particularly important. By treating each modality as a distinct semantic environment, I$^3$-MRec employs invariant risk minimization (IRM) to learn preference-oriented representations. In parallel, a missing-aware fusion module is developed to explicitly simulate modality-missing scenarios. Built upon the Information Bottleneck (IB) principle, the module aims to preserve essential user preference signals across these scenarios while effectively compressing modality-specific information. Extensive experiments conducted on three real-world datasets demonstrate that I$^3$-MRec consistently outperforms existing state-of-the-art MRS methods across various modality-missing scenarios, highlighting its effectiveness and robustness in practical applications.
翻译:多模态推荐系统通过整合来自多个模态的互补语义信息来提升推荐性能。然而,由于图像缺失和描述不完整,实际应用中完整的多模态假设很少成立,这阻碍了模型的鲁棒性和泛化能力。为应对这些挑战,我们提出了一种名为 \textbf{I$^3$-MRec} 的新方法,该方法利用基于\textbf{I}信息瓶颈原理的\textbf{I}不变性学习来处理\textbf{I}不完整的\textbf{M}多模态\textbf{Rec}推荐问题。为了在模态缺失场景下实现鲁棒性能,I$^3$-MRec 强化了两个关键特性:(i)跨模态偏好不变性,确保在不同模态环境下用户偏好建模的一致性;(ii)紧凑而有效的多模态表示,由于此类场景中模态信息变得不可靠,减少对模态特定信息的依赖尤为重要。通过将每个模态视为一个独立的语义环境,I$^3$-MRec 采用不变风险最小化来学习面向偏好的表示。同时,开发了一个缺失感知融合模块,以显式模拟模态缺失场景。该模块基于信息瓶颈原理,旨在保留这些场景中关键的用户偏好信号,同时有效压缩模态特定信息。在三个真实数据集上进行的大量实验表明,I$^3$-MRec 在各种模态缺失场景中均持续优于现有的先进多模态推荐方法,凸显了其在实际应用中的有效性和鲁棒性。