In this paper, we illustrate how to fine-tune the entire Retrieval Augment Generation (RAG) architecture in an end-to-end manner. We highlighted the main engineering challenges that needed to be addressed to achieve this objective. We also compare how end-to-end RAG architecture outperforms the original RAG architecture for the task of question answering. We have open-sourced our implementation in the HuggingFace Transformers library.
翻译:在本文中,我们举例说明如何以端对端方式微调整个回收新一代增强型(RAG)架构。我们强调了实现这一目标需要应对的主要工程挑战。我们还比较了端到端的RAG架构如何优于最初的RAG架构,以完成回答问题的任务。我们以开放源码的形式在Hugging Face变换图书馆实施。