Human experts write summaries using different techniques, including rewriting a sentence in the document or fusing multiple sentences to generate a summary sentence. These techniques are flexible and thus difficult to be imitated by any single method. To address this issue, we propose an adaptive model, GEMINI, that integrates a rewriter and a fuser to mimic the sentence rewriting and fusion techniques, respectively. GEMINI adaptively chooses to rewrite a specific document sentence or generate a summary sentence from scratch. Experiments demonstrate that our adaptive approach outperforms the pure abstractive and rewriting baselines on various benchmark datasets, especially when the dataset has a balanced distribution of styles. Interestingly, empirical results show that the human writing style of each summary sentence is consistently predictable given its context.
翻译:人类专家使用不同的技术编写摘要,包括重写文档中的句子或融合多个句子以生成摘要句子。这些技术是灵活的,因此很难通过任何单一方法模仿。为了解决这个问题,我们提出了一个自适应模型GEMINI,将一个句子重写器和一个融合器集成起来,以模仿句子重写和融合技术。GEMINI可以自适应地选择重写特定的文档句子或从头生成摘要句子。实验表明,我们的自适应方法在各种基准数据集上优于纯抽象和重写基线,特别是当数据集具有平衡的风格分布时。有趣的是,实证结果表明,给定上下文时,每个摘要句子的人类写作风格是可以预测的。