Semi-supervised learning has shown promise in allowing NLP models to generalize from small amounts of labeled data. Meanwhile, pretrained transformer models act as black-box correlation engines that are difficult to explain and sometimes behave unreliably. In this paper, we propose tackling both of these challenges via Automatic Rule Induction (ARI), a simple and general-purpose framework for the automatic discovery and integration of symbolic rules into pretrained transformer models. First, we extract weak symbolic rules from low-capacity machine learning models trained on small amounts of labeled data. Next, we use an attention mechanism to integrate these rules into high-capacity pretrained transformer models. Last, the rule-augmented system becomes part of a self-training framework to boost supervision signal on unlabeled data. These steps can be layered beneath a variety of existing weak supervision and semi-supervised NLP algorithms in order to improve performance and interpretability. Experiments across nine sequence classification and relation extraction tasks suggest that ARI can improve state-of-the-art methods with no manual effort and minimal computational overhead.
翻译:半监督的学习显示,允许NLP模型从少量贴标签数据中进行概括化。 与此同时,预先培训的变压器模型作为黑盒相关引擎,很难解释,有时行为不可靠。在本文中,我们建议通过自动规则上岗(ARI)来应对上述两项挑战,这是一个简单和通用的框架,用于自动发现象征性规则,并将其纳入预培训变压器模型。首先,我们从在少量贴标签数据方面受过培训的低容量机器学习模型中提取了微弱的象征性规则。接下来,我们使用关注机制将这些规则纳入高容量预培训变压器模型。最后,规则强化系统成为提升无标签数据监督信号的自我培训框架的一部分。这些步骤可以置于各种现有的薄弱监管和半受监督的NLP算法之下,以提高性能和可判读性。在九个序列分类和关联提取任务中进行的实验表明,ARI可以改进最先进的方法,而没有手工操作和最低计算管理。