Machine Reading Comprehension with Unanswerable Questions is a difficult NLP task, challenged by the questions which can not be answered from passages. It is observed that subtle literal changes often make an answerable question unanswerable, however, most MRC models fail to recognize such changes. To address this problem, in this paper, we propose a span-based method of Contrastive Learning (spanCL) which explicitly contrast answerable questions with their answerable and unanswerable counterparts at the answer span level. With spanCL, MRC models are forced to perceive crucial semantic changes from slight literal differences. Experiments on SQuAD 2.0 dataset show that spanCL can improve baselines significantly, yielding 0.86-2.14 absolute EM improvements. Additional experiments also show that spanCL is an effective way to utilize generated questions.
翻译:机读理解无法解答的问题是一项困难的NLP任务,受到无法从段落中解答的问题的质疑。据观察,微妙的字面变化往往使一个无法解答的问题无法回答,然而,大多数MRC模型都不承认这样的变化。为了解决这个问题,我们在本文件中建议采用基于跨线的反向学习方法(SpanCL),该方法明确将可答问题与在回答范围内的可答和无法解答的对应方作对比。有了BluCL,MRC模型不得不从轻微的字面差异中看到关键的语义变化。 SQuAD 2. 0数据集实验显示,跨线CL可以大大改进基线,产生0.86-2.14的绝对EMM改进。其他实验还表明,BACL是利用所产生问题的有效方法。