从大型预培训模式中提炼特定任务逻辑规则 (Distilling Task-specific Logical Rules from Large Pre-trained Models)

Logical rules, both transferable and explainable, are widely used as weakly supervised signals for many downstream tasks such as named entity tagging. To reduce the human effort of writing rules, previous researchers adopt an iterative approach to automatically learn logical rules from several seed rules. However, obtaining more seed rules can only be accomplished by extra human annotation with heavy costs. Limited by the size and quality of the seed rules, the model performance of previous systems is bounded. In this paper, we develop a novel framework STREAM to distill task-specific logical rules from large pre-trained models. Specifically, we borrow recent prompt-based language models as the knowledge expert to yield initial seed rules, and based on the formed high-quality instance pool that acts as an intermediary role, we keep teaching the expert to fit our task and learning task-specific logical rules. Experiments on three public named entity tagging benchmarks demonstrate the effectiveness of our proposed framework. With several predefined prompt templates, our system has gained significant improvements over previous state-of-the-art methods.

翻译：逻辑规则,包括可转让和可解释的逻辑规则,被广泛用作许多下游任务,如名称实体标记等的薄弱监管信号。为减少人类写作规则的努力,前研究人员采取了一种迭代方法,自动从几个种子规则中学习逻辑规则。然而,获得更多的种子规则只能通过额外的人文说明来完成,费用高昂。由于种子规则的大小和质量有限,先前系统的模型性能相互交错。在本文件中,我们开发了一个新的框架STREAM,以从大型的预先培训模式中提取具体任务逻辑规则。具体地说,我们借用了最新的快速语言模型作为知识专家,以产生原始种子规则,并以构成的高质量实例集合为基础,作为中间角色,我们不断教导专家如何适应我们的任务和学习特定任务的逻辑规则。关于三个公共名称实体标记基准的实验表明我们拟议框架的有效性。我们系统有一些预先定义的快速模板,比以往的先进方法有了显著改进。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/