用于检索增强型大语言模型的属性和流利取舍 (Characterizing Attribution and Fluency Tradeoffs for Retrieval-Augmented Large Language Models)

Despite recent progress, it has been difficult to prevent semantic hallucinations in generative Large Language Models. One common solution to this is augmenting LLMs with a retrieval system and making sure that the generated output is attributable to the retrieved information. Given this new added constraint, it is plausible to expect that the overall quality of the output will be affected, for example, in terms of fluency. Can scaling language models help? Here we examine the relationship between fluency and attribution in LLMs prompted with retrieved evidence in knowledge-heavy dialog settings. Our experiments were implemented with a set of auto-metrics that are aligned with human preferences. They were used to evaluate a large set of generations, produced under varying parameters of LLMs and supplied context. We show that larger models tend to do much better in both fluency and attribution, and that (naively) using top-k retrieval versus top-1 retrieval improves attribution but hurts fluency. We next propose a recipe that could allow smaller models to both close the gap with larger models and preserve the benefits of top-k retrieval while avoiding its drawbacks.

翻译：尽管最近取得了进展,但很难防止在基因型大语言模型中出现语义上的幻觉。这方面的一个共同解决办法是用一个检索系统扩大LLMS,确保生成的产出可归因于检索的信息。鉴于这一新的额外限制,似乎可以预期产出的总体质量会受到影响,例如,流畅性方面。缩放语言模型能帮助吗?我们在这里研究LLMS中流利和归属之间的关系,这些流利和归属是在知识重对话环境中以检索的证据促进的。我们实验是在一套符合人类偏好的自动测量方法下进行的。它们被用来评价在各种LLMS参数下产生的大量代人,并提供了背景。我们表明,较大的模型往往在流利性和归属性两方面都做得更好,而且(通常地)使用顶级检索和顶级检索可以改进归属,但会伤害流利性。我们随后建议一种配方,允许较小的模型既用较大的模型来缩小差距,又保留顶级检索的好处,同时避免其背。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日