Question answering (QA) models for reading comprehension tend to learn shortcut solutions rather than the solutions intended by QA datasets. QA models that have learned shortcut solutions can achieve human-level performance in shortcut examples where shortcuts are valid, but these same behaviors degrade generalization potential on anti-shortcut examples where shortcuts are invalid. Various methods have been proposed to mitigate this problem, but they do not fully take the characteristics of shortcuts themselves into account. We assume that the learnability of shortcuts, i.e., how easy it is to learn a shortcut, is useful to mitigate the problem. Thus, we first examine the learnability of the representative shortcuts on extractive and multiple-choice QA datasets. Behavioral tests using biased training sets reveal that shortcuts that exploit answer positions and word-label correlations are preferentially learned for extractive and multiple-choice QA, respectively. We find that the more learnable a shortcut is, the flatter and deeper the loss landscape is around the shortcut solution in the parameter space. We also find that the availability of the preferred shortcuts tends to make the task easier to perform from an information-theoretic viewpoint. Lastly, we experimentally show that the learnability of shortcuts can be utilized to construct an effective QA training set; the more learnable a shortcut is, the smaller the proportion of anti-shortcut examples required to achieve comparable performance on shortcut and anti-shortcut examples. We claim that the learnability of shortcuts should be considered when designing mitigation methods.
翻译:读取解答( QA) 模式, 读取解答( QA) 模式, 倾向于学习捷径解决方案, 而不是QA 数据集 。 学习捷径解决方案 的 QA 模式, 可以在捷径有效的捷径示例中实现人类层面的性能, 但同样的行为会降低反shortcut( QA ) 实例的概括化潜力。 已经提出了各种方法来缓解这一问题, 但是它们并没有充分考虑捷径本身的特性。 我们认为, 捷径的可学性, 也就是学习捷径有多容易, 学习捷径有多容易, 从而缓解问题。 因此, 我们首先研究在采掘和多选取的 QA 数据集中代表捷径的可学性。 使用偏差的培训组合显示, 利用答案位置和字标签关联的捷径的捷径是优的。 我们发现, 更能学习捷径的捷径更小的捷径和损失环境应该围绕反捷径解决方案进行思考。 我们还发现, 首选的捷径的可用性是更容易从我们所学的捷径的捷径 。,,, 学习的可更容易在最后学习的捷径上更容易地学习到我们所学的 。