在问题生成模型中,隐含形式是多余的 (Inflected Forms Are Redundant in Question Generation Models)

Neural models with an encoder-decoder framework provide a feasible solution to Question Generation (QG). However, after analyzing the model vocabulary we find that current models (both RNN-based and pre-training based) have more than 23\% inflected forms. As a result, the encoder will generate separate embeddings for the inflected forms, leading to a waste of training data and parameters. Even worse, in decoding these models are vulnerable to irrelevant noise and they suffer from high computational costs. In this paper, we propose an approach to enhance the performance of QG by fusing word transformation. Firstly, we identify the inflected forms of words from the input of encoder, and replace them with the root words, letting the encoder pay more attention to the repetitive root words. Secondly, we propose to adapt QG as a combination of the following actions in the encode-decoder framework: generating a question word, copying a word from the source sequence or generating a word transformation type. Such extension can greatly decrease the size of predicted words in the decoder as well as noise. We apply our approach to a typical RNN-based model and \textsc{UniLM} to get the improved versions. We conduct extensive experiments on SQuAD and MS MARCO datasets. The experimental results show that the improved versions can significantly outperform the corresponding baselines in terms of BLEU, ROUGE-L and METEOR as well as time cost.

翻译：带有编码器- decoder 框架的神经模型为问题生成提供了可行的解决方案。然而,在分析了模型词汇之后,我们发现目前的模型(基于 RNN 和前培训基础)有23 ⁇ 以上反射形式。因此,编码器将为隐含形式产生单独的嵌入,从而导致培训数据和参数的浪费。更糟糕的是,在解码这些模型时很容易受到不相关的噪音的影响,而且它们会受到高计算成本的影响。在本文中,我们提出一种方法,通过使用文字转换来提高QG的性能。首先,我们从编码器输入的输入中找出隐含的单词形式,并用根字取代它们。因此,编码器将产生不同的嵌入式嵌入,从而导致培训数据和参数的浪费。我们提议将QG作为编码解码-decoder框架中以下行动的组合:生成一个问题单词,复制源序列中的单词,或生成一个词变变换类型。这种扩展可以大大降低REG的模型和变式的模型的大小,将S-NUL 格式作为典型的S-RODRM 的模型,并显示我们的S-RADR-L 改进的数据版本。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日