Explainable question answering systems should produce not only accurate answers but also rationales that justify their reasoning and allow humans to check their work. But what sorts of rationales are useful and how can we train systems to produce them? We propose a new style of rationale for open-book question answering, called \emph{markup-and-mask}, which combines aspects of extractive and free-text explanations. In the markup phase, the passage is augmented with free-text markup that enables each sentence to stand on its own outside the discourse context. In the masking phase, a sub-span of the marked-up passage is selected. To train a system to produce markup-and-mask rationales without annotations, we leverage in-context learning. Specifically, we generate silver annotated data by sending a series of prompts to a frozen pretrained language model, which acts as a teacher. We then fine-tune a smaller student model by training on the subset of rationales that led to correct answers. The student is "honest" in the sense that it is a pipeline: the rationale acts as a bottleneck between the passage and the answer, while the "untrusted" teacher operates under no such constraints. Thus, we offer a new way to build trustworthy pipeline systems from a combination of end-task annotations and frozen pretrained language models.
翻译:解答问题解答系统不仅应产生准确的答案,而且还应产生解释其推理和允许人类检查其工作的理由。但是,选择了何种理由是有用的,我们如何培训系统来产生这些理由呢?我们提出了开放书籍问答的新型理论原理,称为\emph{margup-and-mask},它将采掘和自由文本解释的各方面结合起来。在评分阶段,通过自由文本标记增加一段,使每个句子都能在讨论范围之外站立。在遮盖阶段,选择了标记通道的子范围。为了训练一个系统,在没有说明的情况下产生标记和标记的理由,我们利用正文学习。具体地说,我们通过将一系列提示发送到冷冻的预先训练语言模型,作为教师。然后,我们通过对导致正确回答的一组理由进行微小学生模型的微调。学生是“诚实的”的,因为这是一个管道:原理原理模型在没有说明说明的情况下产生标记标记标记标记,我们用一个固定的标签,同时我们用一种固定的标签来设置一个固定的标签。