We present assertion based question answering (ABQA), an open domain question answering task that takes a question and a passage as inputs, and outputs a semi-structured assertion consisting of a subject, a predicate and a list of arguments. An assertion conveys more evidences than a short answer span in reading comprehension, and it is more concise than a tedious passage in passage-based QA. These advantages make ABQA more suitable for human-computer interaction scenarios such as voice-controlled speakers. Further progress towards improving ABQA requires richer supervised dataset and powerful models of text understanding. To remedy this, we introduce a new dataset called WebAssertions, which includes hand-annotated QA labels for 358,427 assertions in 55,960 web passages. To address ABQA, we develop both generative and extractive approaches. The backbone of our generative approach is sequence to sequence learning. In order to capture the structure of the output assertion, we introduce a hierarchical decoder that first generates the structure of the assertion and then generates the words of each field. The extractive approach is based on learning to rank. Features at different levels of granularity are designed to measure the semantic relevance between a question and an assertion. Experimental results show that our approaches have the ability to infer question-aware assertions from a passage. We further evaluate our approaches by incorporating the ABQA results as additional features in passage-based QA. Results on two datasets show that ABQA features significantly improve the accuracy on passage-based~QA.
翻译:我们提出了基于断言的答题(ABQA),这是一个开放域答题(ABQA),这是一个以问答为输入内容的开放域答题,一个半结构化的论断,由主题、前提和论点列表组成。一个论断在阅读理解中传达的证据多于一个简短的答题,比基于通道的答题简单得多。这些优点使得ABQA更适合像语音控制演讲者这样的人-计算机互动假设。改进ABQA的进一步进展需要更富有的监管数据集和强有力的文本理解模型。为了纠正这一点,我们引入了一个新的数据集,称为WebAssecients, 其中包括在55 960个网页段落中358 427个标语的手标注QA特性。为了对ABQA的简短回答,我们发展了既具有基因化和采掘方法的精度。我们的感官方法的骨干是顺序学习顺序。为了捕捉到产出主张的结构,我们引入了一种等级分解法,首先产生基于主张的结构,然后产生每个字段的词汇。采掘方法基于学习等级的等级学到等级特性特性的特性。A-Adalalalalalalalalal-axalalalal laxal lax 一种测量测量到一个不同层次测量的比。