Two crucial issues for text summarization to generate faithful summaries are to make use of knowledge beyond text and to make use of cross-sentence relations in text. Intuitive ways for the two issues are Knowledge Graph (KG) and Graph Neural Network (GNN) respectively. Entities are semantic units in text and in KG. This paper focuses on both issues by leveraging entities mentioned in text to connect GNN and KG for summarization. Firstly, entities are leveraged to construct a sentence-entity graph with weighted multi-type edges to model sentence relations, and a relational heterogeneous GNN for summarization is proposed to calculate node encodings. Secondly, entities are leveraged to link the graph to KG to collect knowledge. Thirdly, entities guide a two-step summarization framework defining a multi-task selector to select salient sentences and entities, and using an entity-focused abstractor to compress the sentences. GNN is connected with KG by constructing sentence-entity graphs where entity-entity edges are built based on KG, initializing entity embeddings on KG, and training entity embeddings using entity-entity edges. The relational heterogeneous GNN utilizes both edge weights and edge types in GNN to calculate graphs with weighted multi-type edges. Experiments show the proposed method outperforms extractive baselines including the HGNN-based HGNNSum and abstractive baselines including the entity-driven SENECA on CNN/DM, and outperforms most baselines on NYT50. Experiments on sub-datasets show the density of sentence-entity edges greatly influences the performance of the proposed method. The greater the density, the better the performance. Ablations show effectiveness of the method.
翻译:文本总和生成准确摘要的两个关键问题是,利用文本以外的知识,利用文本中的跨句子关系。两个问题的直观方法分别为《知识图表》(KGG)和《图表神经网络》(GNN)。实体在文本和KG中都是语义单位。本文侧重于这两个问题,办法是利用文本中提及的实体来连接 GNN 和 KG 以进行语义拼写。首先,利用实体来构建一个句子实体实体属性图,其中含有加权的多类型基线边距与模型句子关系,并提议一个用于计算节点编码的关连性 GNNNNNNN,其次,以计算节点码编码。第二,利用实体将图表链接链接到 KGG(KGG) 收集知识。第三,实体指导一个双步式的语系和缩略式缩略图框架,其中显示GNFR(包括GGG) 格式的更高等级的运行情况。GNNF(包括G) 和GMIL(G) 格式中的大多数实体使用G-raldealdalal) 格式的运行方法,其中显示G-ral-ral-deal-de 。