The ad-hoc retrieval task is to rank related documents given a query and a document collection. A series of deep learning based approaches have been proposed to solve such problem and gained lots of attention. However, we argue that they are inherently based on local word sequences, ignoring the subtle long-distance document-level word relationships. To solve the problem, we explicitly model the document-level word relationship through the graph structure, capturing the subtle information via graph neural networks. In addition, due to the complexity and scale of the document collections, it is considerable to explore the different grain-sized hierarchical matching signals at a more general level. Therefore, we propose a Graph-based Hierarchical Relevance Matching model (GHRM) for ad-hoc retrieval, by which we can capture the subtle and general hierarchical matching signals simultaneously. We validate the effects of GHRM over two representative ad-hoc retrieval benchmarks, the comprehensive experiments and results demonstrate its superiority over state-of-the-art methods.
翻译:特设检索的任务是根据查询和文件收藏对相关文件进行排序。已经提出了一系列基于深层次学习的办法来解决这一问题并引起人们的极大关注。然而,我们争辩说,这些方法本质上是基于本地的单词序列,忽视了微妙的长距离文档级单词关系。为了解决问题,我们通过图形结构明确模拟文件级单词关系,通过图形神经网络捕捉微妙的信息。此外,由于文件收藏的复杂性和规模,在更普遍的层次上探索不同的粒子尺寸等级匹配信号相当可观。因此,我们提议了一个基于图表的分级相关性匹配模型(GHRM)用于特别检索,通过该模型我们可以同时捕捉微妙和一般的分级匹配信号。我们验证GHRM对两个具有代表性的特设检索基准、全面实验和结果的影响,表明其优于最先进的方法。