Abstractive summarization at controllable lengths is a challenging task in natural language processing. It is even more challenging for domains where limited training data is available or scenarios in which the length of the summary is not known beforehand. At the same time, when it comes to trusting machine-generated summaries, explaining how a summary was constructed in human-understandable terms may be critical. We propose Multi-level Summarizer (MLS), a supervised method to construct abstractive summaries of a text document at controllable lengths. The key enabler of our method is an interpretable multi-headed attention mechanism that computes attention distribution over an input document using an array of timestep independent semantic kernels. Each kernel optimizes a human-interpretable syntactic or semantic property. Exhaustive experiments on two low-resource datasets in the English language show that MLS outperforms strong baselines by up to 14.70% in the METEOR score. Human evaluation of the summaries also suggests that they capture the key concepts of the document at various length-budgets.
翻译:在可控制长度下,抽象总结是自然语言处理中一项艰巨的任务。对于现有培训数据有限或摘要长度事先不为人知的假设的域,这甚至更加具有挑战性。同时,当涉及信任机器生成的摘要时,解释如何用人类无法理解的术语构建摘要可能是关键。我们提议多层次抽取器(MLS),这是在可控制长度内构建文本文档抽象摘要的受监督方法。我们方法的关键使能者是一个可解释的多头关注机制,它利用一系列时间步骤独立语义内核来计算输入文件的注意力分布。每个内核都优化了人类内部合成或语义属性。关于英语中两个低资源数据集的探索实验显示,MLS在METEOR分数中比强基线高出14.70%。对摘要的人类评估还表明,它们在不同长度列表中捕捉到文件的关键概念。