Recent researches have demonstrated that BERT shows potential in a wide range of natural language processing tasks. It is adopted as an encoder for many state-of-the-art automatic summarizing systems, which achieve excellent performance. However, so far, there is not much work done for Vietnamese. In this paper, we showcase how BERT can be implemented for extractive text summarization in Vietnamese. We introduce a novel comparison between different multilingual and monolingual BERT models. The experiment results indicate that monolingual models produce promising results compared to other multilingual models and previous text summarizing models for Vietnamese.
翻译:最近的研究显示,BERT在广泛的自然语言处理任务中显示出潜力,它被作为许多最先进的自动总结系统的一个编码器,这些自动总结系统取得了卓越的业绩,然而,到目前为止,越南人没有做多少工作。在本文中,我们展示了越南人如何执行BERT的抽取文本总结。我们引入了不同多种语言和单一语言的BERT模式之间的新比较。实验结果表明,单语模式与其他多语模式和以前概述越南人模式的文本相比,产生了有希望的结果。