Despite recent improvements in abstractive summarization, most current approaches generate summaries that are not factually consistent with the source document, severely restricting their trust and usage in real-world applications. Recent works have shown promising improvements in factuality error identification using text or dependency arc entailments; however, they do not consider the entire semantic graph simultaneously. To this end, we propose FactGraph, a method that decomposes the document and the summary into structured meaning representations (MR), which are more suitable for factuality evaluation. MRs describe core semantic concepts and their relations, aggregating the main content in both document and summary in a canonical form, and reducing data sparsity. FactGraph encodes such graphs using a graph encoder augmented with structure-aware adapters to capture interactions among the concepts based on the graph connectivity, along with text representations using an adapter-based text encoder. Experiments on different benchmarks for evaluating factuality show that FactGraph outperforms previous approaches by up to 15%. Furthermore, FactGraph improves performance on identifying content verifiability errors and better captures subsentence-level factual inconsistencies.
翻译:尽管最近在抽象概括方面有所改进,但大多数现行方法都产生与原始文件不符的概要,严重限制了对真实世界应用的信任和使用。最近的工作显示,在使用文字或依赖弧隐含物进行事实质量错误识别方面大有改进;然而,它们并没有同时考虑整个语义图。为此,我们建议采用FactGraph,这是将文件和摘要分解成结构化含义表示法的一种方法,更适合进行事实质量评估。MR描述核心语义概念及其关系,将文件和摘要的主要内容汇集在一种罐头形式中,并减少数据偏斜性。FactGraph用图表编码这种图表,用结构敏化的调整器加以扩充,以捕捉基于图形连通性的概念之间的相互作用,同时使用基于适应器的文字表示法。关于评估事实质量的不同基准的实验表明,FactGraph比以前的做法高出15%。此外,FactGraph改进了识别内容可核实性错误和更好地捕捉取子层面事实不一致性方面的绩效。