RAG悖论：一种利用检索增强生成系统中非故意漏洞的黑盒攻击 (The RAG Paradox: A Black-Box Attack Exploiting Unintentional Vulnerabilities in Retrieval-Augmented Generation Systems)

With the growing adoption of retrieval-augmented generation (RAG) systems, various attack methods have been proposed to degrade their performance. However, most existing approaches rely on unrealistic assumptions in which external attackers have access to internal components such as the retriever. To address this issue, we introduce a realistic black-box attack based on the RAG paradox, a structural vulnerability arising from the system's effort to enhance trust by revealing both the retrieved documents and their sources to users. This transparency enables attackers to observe which sources are used and how information is phrased, allowing them to craft poisoned documents that are more likely to be retrieved and upload them to the identified sources. Moreover, as RAG systems directly provide retrieved content to users, these documents must not only be retrievable but also appear natural and credible to maintain user confidence in the search results. Unlike prior work that focuses solely on improving document retrievability, our attack method explicitly considers both retrievability and user trust in the retrieved content. Both offline and online experiments demonstrate that our method significantly degrades system performance without internal access, while generating natural-looking poisoned documents.

翻译：随着检索增强生成（RAG）系统的日益普及，已有多种攻击方法被提出以降低其性能。然而，现有方法大多依赖于不切实际的假设，即外部攻击者能够访问检索器等内部组件。为解决这一问题，我们提出了一种基于RAG悖论的现实黑盒攻击，该漏洞源于系统为增强可信度而向用户展示检索到的文档及其来源的结构性弱点。这种透明度使攻击者能够观察所使用的来源及信息表述方式，从而制作更可能被检索到的污染文档并上传至已识别的来源。此外，由于RAG系统直接将检索内容提供给用户，这些文档不仅需具备可检索性，还必须呈现自然可信的外观以维持用户对搜索结果的信任。与以往仅关注提升文档可检索性的研究不同，我们的攻击方法明确考虑了检索内容的可检索性与用户信任度。离线和在线实验均表明，该方法在无需内部访问权限的情况下显著降低了系统性能，同时能生成外观自然的污染文档。

相关内容

黑盒

关注 1

在科学，计算和工程学中，黑盒是一种设备，系统或对象，可以根据其输入和输出（或传输特性）对其进行查看，而无需对其内部工作有任何了解。它的实现是“不透明的”（黑色）。几乎任何事物都可以被称为黑盒：晶体管，引擎，算法，人脑，机构或政府。为了使用典型的“黑匣子方法”来分析建模为开放系统的事物，仅考虑刺激/响应的行为，以推断（未知）盒子。该黑匣子系统的通常表示形式是在该方框中居中的数据流程图。黑盒的对立面是一个内部组件或逻辑可用于检查的系统，通常将其称为白盒（有时也称为“透明盒”或“玻璃盒”）。

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日