Machine Reading Comprehension has become one of the most advanced and popular research topics in the fields of Natural Language Processing in recent years. The classification of answerability questions is a relatively significant sub-task in machine reading comprehension; however, there haven't been many studies. Retro-Reader is one of the studies that has solved this problem effectively. However, the encoders of most traditional machine reading comprehension models in general and Retro-Reader, in particular, have not been able to exploit the contextual semantic information of the context completely. Inspired by SemBERT, we use semantic role labels from the SRL task to add semantics to pre-trained language models such as mBERT, XLM-R, PhoBERT. This experiment was conducted to compare the influence of semantics on the classification of answerability for the Vietnamese machine reading comprehension. Additionally, we hope this experiment will enhance the encoder for the Retro-Reader model's Sketchy Reading Module. The improved Retro-Reader model's encoder with semantics was first applied to the Vietnamese Machine Reading Comprehension task and obtained positive results.
翻译:近年来,可回答问题的分类是机器阅读理解方面一个相对重要的子任务;然而,还没有开展过许多研究。 Retro-Reader是有效解决这一问题的研究之一。然而,大多数传统机器阅读理解模型的编码者一般和Retro-Reader尤其未能完全利用上下文的语义信息。在SemBERT的启发下,我们使用SRL任务中的语义角色标签,在MBERT、XLM-R、PhoBERT等经过预先训练的语言模型中添加语义学。这项实验是为了比较越南机器阅读理解可回答性分类的语义学影响。此外,我们希望这一实验将加强Retro-Reader模型Sketchy阅读模块的语义化。改进的Retro-Reder模型与语义学的编码模型首先应用到越南机器阅读结果的积极任务。