The anthology of spoken languages today is inundated with textual information, necessitating the development of automatic summarization models. In this manuscript, we propose an extractor-paraphraser based abstractive summarization system that exploits semantic overlap as opposed to its predecessors that focus more on syntactic information overlap. Our model outperforms the state-of-the-art baselines in terms of ROUGE, METEOR and word mover similarity (WMS), establishing the superiority of the proposed system via extensive ablation experiments. We have also challenged the summarization capabilities of the state of the art Pointer Generator Network (PGN), and through thorough experimentation, shown that PGN is more of a paraphraser, contrary to the prevailing notion of a summarizer; illustrating it's incapability to accumulate information across multiple sentences.
翻译:今天的口语文文文集全无文字信息,因此需要开发自动概括模型。在这个手稿中,我们提议了一个基于抽象句式的抽象总结系统,它利用语义重叠,而不是其前身,它更侧重于合成信息重叠。我们的模型在罗热、METEOR和字动词相似性方面优于最先进的基线,通过广泛的通缩实验确定了拟议系统的优越性。我们还对尖端点点发电机网络(PGN)和彻底实验的汇总能力提出了挑战,显示PGN更像是一个副口语,与流行的总结者概念相反;表明它无法在多个句子中积累信息。