关于多窗口问题回答和生成的调查 (A Survey on Multi-hop Question Answering and Generation)

The problem of Question Answering (QA) has attracted significant research interest for long. Its relevance to language understanding and knowledge retrieval tasks, along with the simple setting makes the task of QA crucial for strong AI systems. Recent success on simple QA tasks has shifted the focus to more complex settings. Among these, Multi-Hop QA (MHQA) is one of the most researched tasks over the recent years. The ability to answer multi-hop questions and perform multi step reasoning can significantly improve the utility of NLP systems. Consequently, the field has seen a sudden surge with high quality datasets, models and evaluation strategies. The notion of `multiple hops' is somewhat abstract which results in a large variety of tasks that require multi-hop reasoning. This implies that different datasets and models differ significantly which makes the field challenging to generalize and survey. This work aims to provide a general and formal definition of MHQA task, and organize and summarize existing MHQA frameworks. We also outline the best methods to create MHQA datasets. The paper provides a systematic and thorough introduction as well as the structuring of the existing attempts to this highly interesting, yet quite challenging task.

翻译：问题解答(QA)问题长期以来引起大量研究兴趣,它与语言理解和知识检索任务的相关性,加上简单设置,使得QA的任务对于强大的AI系统至关重要。最近,简单质量解答任务的成功将重点转移到了更复杂的环境。其中,多功能问答(MHQA)是近年来研究最多的任务之一。回答多窗口问题和进行多步骤推理的能力可以大大改善NLP系统的实用性。因此,外地出现突然激增,拥有高质量的数据集、模型和评价战略。“多跳”的概念有些抽象,导致需要多功能推理的大量任务。这意味着不同的数据集和模型差异很大,使得这个领域对概括和调查具有挑战性。这项工作的目的是为MHQA任务提供一个一般和正式的定义,并组织和总结现有的MHQA框架。我们还概述了创建MHQA数据集的最佳方法。“多跳”的概念有些抽象性,导致需要多功能推理的大量任务。这意味着,不同的数据集和模型差异很大,使得这个领域难以概括和调查。这项工作旨在为MHQA任务提供一个一般和正式的定义,并组织和总结现有的框架。我们还概述了创建MHQA数据集的最佳方法。该文件提供了一个非常有趣的尝试。