We focus on a type of linguistic formal reasoning where the goal is to reason over explicit knowledge in the form of natural language facts and rules (Clark et al., 2020). A recent work, named PRover (Saha et al., 2020), performs such reasoning by answering a question and also generating a proof graph that explains the answer. However, compositional reasoning is not always unique and there may be multiple ways of reaching the correct answer. Thus, in our work, we address a new and challenging problem of generating multiple proof graphs for reasoning over natural language rule-bases. Each proof provides a different rationale for the answer, thereby improving the interpretability of such reasoning systems. In order to jointly learn from all proof graphs and exploit the correlations between multiple proofs for a question, we pose this task as a set generation problem over structured output spaces where each proof is represented as a directed graph. We propose two variants of a proof-set generation model, multiPRover. Our first model, Multilabel-multiPRover, generates a set of proofs via multi-label classification and implicit conditioning between the proofs; while the second model, Iterative-multiPRover, generates proofs iteratively by explicitly conditioning on the previously generated proofs. Experiments on multiple synthetic, zero-shot, and human-paraphrased datasets reveal that both multiPRover models significantly outperform PRover on datasets containing multiple gold proofs. Iterative-multiPRover obtains state-of-the-art proof F1 in zero-shot scenarios where all examples have single correct proofs. It also generalizes better to questions requiring higher depths of reasoning where multiple proofs are more frequent. Our code and models are publicly available at https://github.com/swarnaHub/multiPRover
翻译:我们侧重于一种语言正式推理,其目标在于以自然语言事实和规则的形式解释清楚的知识(Clark等人,2020年)。最近的一项工作,名为PRover(Saha等人,2020年),通过回答一个问题和生成一个能解释答案的证明图表,来进行这种推理。然而,组成推理并不总是独特的,而且可能有许多方法达成正确的答案。因此,在我们的工作中,我们处理一个具有挑战性的新问题,即为自然语言规则基础的推理制作多种证据图表。每一项证据都为答案提供了不同的理由,从而改进了这种推理系统的可解释性。为了共同从所有证据图表中学习并利用一个问题多重证据之间的相互关系(Saha等人等人,2020年),我们将此任务作为结构化产出空间中的一个固定的生成问题,其中每种证据都以直接图表形式表示。我们提出了两个校正的生成模型,即多标签-多版本的模型,通过多标签分类和隐含的校正来生成所有证据;第二个模型,要求多个更高级的模型和多版本的滚动数据。