加强具有生成问题的文字结构知识的预培训模式 (Enhancing Pre-trained Models with Text Structure Knowledge for Question Generation)

Today the pre-trained language models achieve great success for question generation (QG) task and significantly outperform traditional sequence-to-sequence approaches. However, the pre-trained models treat the input passage as a flat sequence and are thus not aware of the text structure of input passage. For QG task, we model text structure as answer position and syntactic dependency, and propose answer localness modeling and syntactic mask attention to address these limitations. Specially, we present localness modeling with a Gaussian bias to enable the model to focus on answer-surrounded context, and propose a mask attention mechanism to make the syntactic structure of input passage accessible in question generation process. Experiments on SQuAD dataset show that our proposed two modules improve performance over the strong pre-trained model ProphetNet, and combing them together achieves very competitive results with the state-of-the-art pre-trained model.

翻译：今天,经过培训的语言模式在问题生成任务中取得了巨大成功,并大大超越了传统的顺序顺序方法。然而,经过培训的模式将输入通道视为一个平坦的顺序,因此不了解输入通道的文字结构。对于 QG 任务,我们将文本结构建模为回答位置和同步依赖,并提议对回答位置进行建模和合成掩码,以克服这些限制。特别是,我们用高斯式的偏差模拟当地特征,使模型能够侧重于回答环绕背景,并提议一个掩盖关注机制,使输入通道的合成结构在问题生成过程中可以进入。 SQUAD 数据集实验显示,我们提出的两个模块改进了经过培训的强大模型先知网的性能,并结合了经过培训的州级模型,取得了非常有竞争力的结果。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

专知会员服务

97+阅读 · 2020年4月10日