With the rapid advancement of retrieval-augmented vision-language models, multimodal medical retrieval-augmented generation (MMed-RAG) systems are increasingly adopted in clinical decision support. These systems enhance medical applications by performing cross-modal retrieval to integrate relevant visual and textual evidence for tasks, e.g., report generation and disease diagnosis. However, their complex architecture also introduces underexplored adversarial vulnerabilities, particularly via visual input perturbations. In this paper, we propose Medusa, a novel framework for crafting cross-modal transferable adversarial attacks on MMed-RAG systems under a black-box setting. Specifically, Medusa formulates the attack as a perturbation optimization problem, leveraging a multi-positive InfoNCE loss (MPIL) to align adversarial visual embeddings with medically plausible but malicious textual targets, thereby hijacking the retrieval process. To enhance transferability, we adopt a surrogate model ensemble and design a dual-loop optimization strategy augmented with invariant risk minimization (IRM). Extensive experiments on two real-world medical tasks, including medical report generation and disease diagnosis, demonstrate that Medusa achieves over 90% average attack success rate across various generation models and retrievers under appropriate parameter configuration, while remaining robust against four mainstream defenses, outperforming state-of-the-art baselines. Our results reveal critical vulnerabilities in the MMed-RAG systems and highlight the necessity of robustness benchmarking in safety-critical medical applications. The code and data are available at https://anonymous.4open.science/r/MMed-RAG-Attack-F05A.
翻译:随着检索增强视觉语言模型的快速发展,多模态医学检索增强生成(MMed-RAG)系统在临床决策支持中的应用日益广泛。这些系统通过执行跨模态检索,整合相关的视觉与文本证据以支持医疗报告生成和疾病诊断等任务,从而增强医学应用。然而,其复杂的架构也引入了尚未充分探究的对抗性脆弱性,尤其是通过视觉输入扰动。本文提出Medusa,一种在黑盒设置下针对MMed-RAG系统构建跨模态可迁移对抗攻击的新颖框架。具体而言,Medusa将攻击建模为扰动优化问题,利用多正样本InfoNCE损失(MPIL)将对抗性视觉嵌入与医学上合理但恶意的文本目标对齐,从而劫持检索过程。为增强可迁移性,我们采用代理模型集成,并设计了一种结合不变风险最小化(IRM)增强的双循环优化策略。在医疗报告生成和疾病诊断两项真实世界医学任务上的大量实验表明,在适当的参数配置下,Medusa在多种生成模型和检索器上实现了超过90%的平均攻击成功率,同时对四种主流防御方法保持鲁棒性,性能优于现有最先进的基线方法。我们的结果揭示了MMed-RAG系统中的关键脆弱性,并强调了在安全关键的医学应用中进行鲁棒性基准测试的必要性。代码与数据可在 https://anonymous.4open.science/r/MMed-RAG-Attack-F05A 获取。