In the open question answering (OBQA) task, how to select the relevant information from a large corpus is a crucial problem for reasoning and inference. Some datasets (e.g, HotpotQA) mainly focus on testing the model's reasoning ability at the sentence level. To overcome this challenge, many existing frameworks use a deep learning model to select relevant passages and then answer each question by matching a sentence in the corresponding passage. However, such frameworks require long inference time and fail to take advantage of the relationship between passages and sentences. In this work, we present a simple yet effective framework to address these problems by jointly ranking passages and selecting sentences. We propose consistency and similarity constraints to promote the correlation and interaction between passage ranking and sentence selection. In our experiments, we demonstrate that our framework can achieve competitive results and outperform the baseline by 28\% in terms of exact matching of relevant sentences on the HotpotQA dataset.
翻译:在开放式答题(OBQA)任务中,如何从大体中选择有关信息是推理和推论的一个关键问题。一些数据集(例如HotpotQA)主要侧重于在判决一级测试模型的推理能力。为了克服这一挑战,许多现有框架使用深层次学习模式来选择相关段落,然后在相应段落中对应句子来回答每个问题。然而,这种框架需要很长的推论时间,并且没有利用句子和句子之间的关系。在这项工作中,我们提出了一个简单而有效的框架,通过联合排列段落和选择句子来解决这些问题。我们提出一致性和相似性限制,以促进分级和句子选择之间的关联和互动。在我们的实验中,我们证明我们的框架可以取得竞争性的结果,在精确匹配HotpootQA数据集的相关句子方面超过基线28个。