We propose a novel Chain Guided Retriever-reader ({\tt CGR}) framework to model the reasoning chain for multi-hop Science Question Answering. Our framework is capable of performing explainable reasoning without the need of any corpus-specific annotations, such as the ground-truth reasoning chain, or human-annotated entity mentions. Specifically, we first generate reasoning chains from a semantic graph constructed by Abstract Meaning Representation of retrieved evidence facts. A \textit{Chain-aware loss}, concerning both local and global chain information, is also designed to enable the generated chains to serve as distant supervision signals for training the retriever, where reinforcement learning is also adopted to maximize the utility of the reasoning chains. Our framework allows the retriever to capture step-by-step clues of the entire reasoning process, which is not only shown to be effective on two challenging multi-hop Science QA tasks, namely OpenBookQA and ARC-Challenge, but also favors explainability.
翻译:我们提议了一个新颖的链路引向检索阅读器(Tht CGR})框架,以模拟多点科学问题解答的推理链。我们的框架能够执行可解释的推理,而不需要任何具体内容的说明,例如地面真相推理链或人文附加说明的实体。具体地说,我们首先从摘要表示所检索证据事实的抽象表示所构建的语义图中生成推理链。关于本地和全球链路信息的一个\ textit{Chain-aware loss} 也旨在让生成的链路成为远程监督信号,用于培训检索器,其中还采用强化学习,以最大限度地发挥推理链的效用。我们的框架允许检索器捕捉整个推理过程的逐步线索,这不仅在两项具有挑战性的多点科学QA任务(即 OpenBookQA 和 ARC-Challenge)上显示有效,而且有利于解释性。