阅读理解和回答问题混合网络 (Multi-granularity hierarchical attention fusion networks for reading comprehension and question answering)

This paper describes a novel hierarchical attention network for reading comprehension style question answering, which aims to answer questions for a given narrative paragraph. In the proposed method, attention and fusion are conducted horizontally and vertically across layers at different levels of granularity between question and paragraph. Specifically, it first encode the question and paragraph with fine-grained language embeddings, to better capture the respective representations at semantic level. Then it proposes a multi-granularity fusion approach to fully fuse information from both global and attended representations. Finally, it introduces a hierarchical attention network to focuses on the answer span progressively with multi-level softalignment. Extensive experiments on the large-scale SQuAD and TriviaQA datasets validate the effectiveness of the proposed method. At the time of writing the paper (Jan. 12th 2018), our model achieves the first position on the SQuAD leaderboard for both single and ensemble models. We also achieves state-of-the-art results on TriviaQA, AddSent and AddOne-Sent datasets.

翻译：本文描述了阅读理解风格解答新颖的分级关注网络,目的是回答特定叙述段落的问题。在拟议方法中,关注和融合在水平上和垂直跨层次的层次上,在问题和段落之间的颗粒度不同层次上进行。具体地说,它首先将问题和段落编码成细微语言嵌入,以更好地在语义层次上反映各自的表达方式。然后,它提出了一种多级融合方法,以充分融合来自全球和出席的表达式的信息。最后,它引入了一个分级关注网络,以多级软调整的方式逐步关注答案的跨度。关于大规模 SQuAD 和 TriviaQA 数据集的广泛实验证实了拟议方法的有效性。在撰写论文时(Jan. 12th 2018),我们的模型在SQuAD领导板上就单一模型和共同模型取得了第一个位置。我们还取得了TriviaQA、AddSent和Addione-Sent数据集的最新结果。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

学习具有层次标签的图像表示，Learning Representations For Images With Hierarchical Labels

专知会员服务

38+阅读 · 2020年4月6日

【芝加哥大学】GRAPH-BERT: Only Attention is Needed for Learning Graph Representations

专知会员服务

85+阅读 · 2020年1月15日

专知会员服务

41+阅读 · 2019年11月24日