BERT, a pre-trained Transformer model, has achieved ground-breaking performance on multiple NLP tasks. In this paper, we describe BERTSUM, a simple variant of BERT, for extractive summarization. Our system is the state of the art on the CNN/Dailymail dataset, outperforming the previous best-performed system by 1.65 on ROUGE-L. The codes to reproduce our results are available at https://github.com/nlpyang/BertSum
翻译:BERT是受过培训的变异器模型,在多项NLP任务上取得了突破性业绩。本文描述了BERTSUM(BERT的简单变种),用于抽取总结。我们的系统是CNN/Dailymail数据集的最新数据,在ROUGE-L上以1.65比以往最佳的系统表现得更好。复制我们结果的代码见https://github.com/nlpyang/BertSum。