Policy Compliance Detection (PCD) is a task we encounter when reasoning over texts, e.g. legal frameworks. Previous work to address PCD relies heavily on modeling the task as a special case of Recognizing Textual Entailment. Entailment is applicable to the problem of PCD, however viewing the policy as a single proposition, as opposed to multiple interlinked propositions, yields poor performance and lacks explainability. To address this challenge, more recent proposals for PCD have argued for decomposing policies into expression trees consisting of questions connected with logic operators. Question answering is used to obtain answers to these questions with respect to a scenario. Finally, the expression tree is evaluated in order to arrive at an overall solution. However, this work assumes expression trees are provided by experts, thus limiting its applicability to new policies. In this work, we learn how to infer expression trees automatically from policy texts. We ensure the validity of the inferred trees by introducing constrained decoding using a finite state automaton to ensure the generation of valid trees. We determine through automatic evaluation that 63% of the expression trees generated by our constrained generation model are logically equivalent to gold trees. Human evaluation shows that 88% of trees generated by our model are correct.
翻译:在对文本(例如法律框架)进行推理时,我们遇到一项任务,即遵守政策探测(PCD) 。以前处理PCD的工作主要依赖将这项任务建模作为承认文字成份的特例。细节适用于PCD问题,但将政策视为单一的主张,而不是多个相互关联的提议,结果不佳,解释不力。为了应对这一挑战,最近关于PCD的建议主张将政策分解成表达树,包括与逻辑操作员有关的问题。问题解答用于获得对这些问题的答案。最后,对表达树进行评价,以达成一个整体解决办法。然而,这项工作假定表达树由专家提供,从而限制其适用于新政策。在这项工作中,我们学会如何从政策案文中自动推断树木的表达方式。我们通过采用有限的状态自动评估,确保推断的树木的有效性,通过使用有限的状态自动测量,确保生成有效的树木。我们受限制的生成模式所产生的表达树的63%在逻辑上等同于金树。人类评估显示,我们生成的树的88%正确性。