In this paper, we introduce the Reinforced Mnemonic Reader for machine reading comprehension tasks, which enhances previous attentive readers in two aspects. First, a reattention mechanism is proposed to refine current attentions by directly accessing to past attentions that are temporally memorized in a multi-round alignment architecture, so as to avoid the problems of attention redundancy and attention deficiency. Second, a new optimization approach, called dynamic-critical reinforcement learning, is introduced to extend the standard supervised method. It always encourages to predict a more acceptable answer so as to address the convergence suppression problem occurred in traditional reinforcement learning algorithms. Extensive experiments on the Stanford Question Answering Dataset (SQuAD) show that our model achieves state-of-the-art results. Meanwhile, our model outperforms previous systems by over 6% in terms of both Exact Match and F1 metrics on two adversarial SQuAD datasets.
翻译:在本文中,我们引入了用于机器阅读理解任务的强化中音读器阅读器,这在两个方面加强了先前关注的读者。首先,建议了一种保留机制,通过直接接触过去在多轮校准结构中暂时记忆的注意力来改善当前关注,以避免关注冗余和注意力不足的问题。第二,引入了一种新的优化方法,称为动态关键强化学习,以扩展标准监督方法。它总是鼓励预测一种更可接受的答案,以便解决传统强化学习算法中出现的趋同抑制问题。关于斯坦福问题解答数据集的广泛实验显示,我们的模型取得了最新结果。同时,我们的模型在两个对抗性 SQuAD 数据集上的匹配和F1衡量标准比以往系统高出6%以上。