Definitions are essential for term understanding. Recently, there is an increasing interest in extracting and generating definitions of terms automatically. However, existing approaches for this task are either extractive or abstractive - definitions are either extracted from a corpus or generated by a language generation model. In this paper, we propose to combine extraction and generation for definition modeling: first extract self- and correlative definitional information of target terms from the Web and then generate the final definitions by incorporating the extracted definitional information. Experiments demonstrate our framework can generate high-quality definitions for technical terms and outperform state-of-the-art models for definition modeling significantly.
翻译:定义对理解术语至关重要。最近,人们越来越有兴趣自动提取和生成术语定义,然而,这一任务的现有方法要么是抽取性的,要么是抽象的,要么是从文体中提取定义,要么是由语言生成模式生成的定义。在本文件中,我们提议将提取和生成结合起来,用于定义建模:首先从网上提取目标术语的自我和相关定义信息,然后通过纳入提取的定义信息产生最终定义。实验表明,我们的框架可以产生技术术语的高质量定义,并显著地生成出超先进的定义建模模型。