在可控制长度下抽象总结的可解释的多负责人关注 (Interpretable Multi-Headed Attention for Abstractive Summarization at Controllable Lengths)

Abstractive summarization at controllable lengths is a challenging task in natural language processing. It is even more challenging for domains where limited training data is available or scenarios in which the length of the summary is not known beforehand. At the same time, when it comes to trusting machine-generated summaries, explaining how a summary was constructed in human-understandable terms may be critical. We propose Multi-level Summarizer (MLS), a supervised method to construct abstractive summaries of a text document at controllable lengths. The key enabler of our method is an interpretable multi-headed attention mechanism that computes attention distribution over an input document using an array of timestep independent semantic kernels. Each kernel optimizes a human-interpretable syntactic or semantic property. Exhaustive experiments on two low-resource datasets in the English language show that MLS outperforms strong baselines by up to 14.70% in the METEOR score. Human evaluation of the summaries also suggests that they capture the key concepts of the document at various length-budgets.

翻译：在可控制长度下,抽象总结是自然语言处理中一项艰巨的任务。对于现有培训数据有限或摘要长度事先不为人知的假设的域,这甚至更加具有挑战性。同时,当涉及信任机器生成的摘要时,解释如何用人类无法理解的术语构建摘要可能是关键。我们提议多层次抽取器(MLS),这是在可控制长度内构建文本文档抽象摘要的受监督方法。我们方法的关键使能者是一个可解释的多头关注机制,它利用一系列时间步骤独立语义内核来计算输入文件的注意力分布。每个内核都优化了人类内部合成或语义属性。关于英语中两个低资源数据集的探索实验显示,MLS在METEOR分数中比强基线高出14.70%。对摘要的人类评估还表明,它们在不同长度列表中捕捉到文件的关键概念。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

【论文推荐】文本摘要简述

专知会员服务

69+阅读 · 2020年7月20日

自然语言处理中的注意力机制，Attention in Natural Language Processing

专知会员服务

136+阅读 · 2020年5月30日

【IJCAI2020】神经摘要结构性注意力，Neural Abstractive Summarization with Structural Attention

专知会员服务

33+阅读 · 2020年4月24日