Some questions have multiple answers that are not equally correct, i.e. answers are different under different conditions. Conditions are used to distinguish answers as well as to provide additional information to support them. In this paper, we study a more challenging task where answers are constrained by a list of conditions that logically interact, which requires performing logical reasoning over the conditions to determine the correctness of the answers. Even more challenging, we only provide evidences for a subset of the conditions, so some questions may not have deterministic answers. In such cases, models are asked to find probable answers and identify conditions that need to be satisfied to make the answers correct. We propose a new model, TReasoner, for this challenging reasoning task. TReasoner consists of an entailment module, a reasoning module, and a generation module (if the answers are free-form text spans). TReasoner achieves state-of-the-art performance on two benchmark conditional QA datasets, outperforming the previous state-of-the-art by 3-10 points.
翻译:一些问题的答案不尽相同, 也就是说, 答案在不同条件下是不同的。 使用条件来区分答案并提供补充信息支持答案。 在本文中, 我们研究一项更具挑战性的任务, 答案受一系列逻辑互动条件的限制, 这要求对确定答案的正确性的条件进行逻辑推理。 更具有挑战性的是, 我们只为一组条件提供证据, 所以有些问题可能没有决定性的答案。 在这种情况下, 要求模型寻找可能的答案, 并确定需要满足的条件来纠正答案。 我们为这项具有挑战性的推理任务提出了一个新的模型, 即 TReasoner 。 TReasoner 包含一个包含要求模块、 推理模块和一代模块( 如果答案是自由格式的文本宽幅 ) 。 TReasoner 在两个有条件的 QA数据集基准上取得了最先进的表现, 3- 10 点比先前的状态表现快。