Despite the recent advancements in abstractive summarization systems leveraged from large-scale datasets and pre-trained language models, the factual correctness of the summary is still insufficient. One line of trials to mitigate this problem is to include a post-editing process that can detect and correct factual errors in the summary. In building such a post-editing system, it is strongly required that 1) the process has a high success rate and interpretability and 2) has a fast running time. Previous approaches focus on regeneration of the summary using the autoregressive models, which lack interpretability and require high computing resources. In this paper, we propose an efficient factual error correction system RFEC based on entities retrieval post-editing process. RFEC first retrieves the evidence sentences from the original document by comparing the sentences with the target summary. This approach greatly reduces the length of text for a system to analyze. Next, RFEC detects the entity-level errors in the summaries by considering the evidence sentences and substitutes the wrong entities with the accurate entities from the evidence sentences. Experimental results show that our proposed error correction system shows more competitive performance than baseline methods in correcting the factual errors with a much faster speed.
翻译:尽管从大型数据集和经过事先培训的语言模型中获取的抽象总结系统最近有所进展,但摘要的实际正确性仍然不足。缓解这一问题的一线试验是包括编辑后程序,该程序可以探测和纠正摘要中的事实错误。在建立这种编辑后系统时,强烈要求(1)该程序的成功率和可解释性很高,(2)具有快速运行的时间。以前的做法侧重于利用不易解释和需要高计算资源的自动递减模型恢复摘要。在本文件中,我们建议基于实体检索编辑后程序的高效事实错误纠正系统RFEC。RFEC首先通过将判决与目标摘要进行比较,从原始文件中检索证据句子。这种方法大大缩短了用于分析的系统文本长度。接着,RFEC通过考虑证据判决来检测摘要中的实体级错误,并用证据判决中准确的实体替代错误。实验结果表明,我们提议的错误纠正系统在纠正事实错误方面比基准方法更具有竞争力,以更快的速度纠正事实错误。