Logical rules, both transferable and explainable, are widely used as weakly supervised signals for many downstream tasks such as named entity tagging. To reduce the human effort of writing rules, previous researchers adopt an iterative approach to automatically learn logical rules from several seed rules. However, obtaining more seed rules can only be accomplished by extra human annotation with heavy costs. Limited by the size and quality of the seed rules, the model performance of previous systems is bounded. In this paper, we develop a novel framework STREAM to distill task-specific logical rules from large pre-trained models. Specifically, we borrow recent prompt-based language models as the knowledge expert to yield initial seed rules, and based on the formed high-quality instance pool that acts as an intermediary role, we keep teaching the expert to fit our task and learning task-specific logical rules. Experiments on three public named entity tagging benchmarks demonstrate the effectiveness of our proposed framework. With several predefined prompt templates, our system has gained significant improvements over previous state-of-the-art methods.
翻译:逻辑规则,包括可转让和可解释的逻辑规则,被广泛用作许多下游任务,如名称实体标记等的薄弱监管信号。为减少人类写作规则的努力,前研究人员采取了一种迭代方法,自动从几个种子规则中学习逻辑规则。然而,获得更多的种子规则只能通过额外的人文说明来完成,费用高昂。由于种子规则的大小和质量有限,先前系统的模型性能相互交错。在本文件中,我们开发了一个新的框架STREAM,以从大型的预先培训模式中提取具体任务逻辑规则。具体地说,我们借用了最新的快速语言模型作为知识专家,以产生原始种子规则,并以构成的高质量实例集合为基础,作为中间角色,我们不断教导专家如何适应我们的任务和学习特定任务的逻辑规则。关于三个公共名称实体标记基准的实验表明我们拟议框架的有效性。我们系统有一些预先定义的快速模板,比以往的先进方法有了显著改进。