Advancement in large pretrained language models has significantly improved their performance for conditional language generation tasks including summarization albeit with hallucinations. To reduce hallucinations, conventional methods proposed improving beam search or using a fact checker as a postprocessing step. In this paper, we investigate the use of the Natural Language Inference (NLI) entailment metric to detect and prevent hallucinations in summary generation. We propose an NLI-assisted beam re-ranking mechanism by computing entailment probability scores between the input context and summarization model-generated beams during saliency-enhanced greedy decoding. Moreover, a diversity metric is introduced to compare its effectiveness against vanilla beam search. Our proposed algorithm significantly outperforms vanilla beam decoding on XSum and CNN/DM datasets.
翻译:为了减少幻觉,传统方法建议改进光束搜索或使用事实检查器作为后处理步骤。我们在本文件中调查使用自然语言推理(NLI)要求的参数来检测和防止在简易生成过程中产生幻觉。我们建议采用NLI协助的波束重新排位机制,计算输入环境与汇总模型生成的波束在显著增强的贪婪解码过程中的概率分数。此外,还采用了多样性指标来比较其相对于香草光束搜索的有效性。我们提议的算法大大超越了XSum和CNN/D数据集的香草光束解码。