检索具有多语言图表代表制学习的复杂表格 (Retrieving Complex Tables with Multi-Granular Graph Representation Learning)

The task of natural language table retrieval (NLTR) seeks to retrieve semantically relevant tables based on natural language queries. Existing learning systems for this task often treat tables as plain text based on the assumption that tables are structured as dataframes. However, tables can have complex layouts which indicate diverse dependencies between subtable structures, such as nested headers. As a result, queries may refer to different spans of relevant content that is distributed across these structures. Moreover, such systems fail to generalize to novel scenarios beyond those seen in the training set. Prior methods are still distant from a generalizable solution to the NLTR problem, as they fall short in handling complex table layouts or queries over multiple granularities. To address these issues, we propose Graph-based Table Retrieval (GTR), a generalizable NLTR framework with multi-granular graph representation learning. In our framework, a table is first converted into a tabular graph, with cell nodes, row nodes and column nodes to capture content at different granularities. Then the tabular graph is input to a Graph Transformer model that can capture both table cell content and the layout structures. To enhance the robustness and generalizability of the model, we further incorporate a self-supervised pre-training task based on graph-context matching. Experimental results on two benchmarks show that our method leads to significant improvements over the current state-of-the-art systems. Further experiments demonstrate promising performance of our method on cross-dataset generalization, and enhanced capability of handling complex tables and fulfilling diverse query intents. Code and data are available at https://github.com/FeiWang96/GTR.

翻译：自然语言表格检索任务(NLTR) 寻求检索基于自然语言查询的语义相关表格(NLTR) 。用于此任务的现有学习系统通常将表格视为简单的文本, 依据的假设是, 表格的结构结构是数据框架。但是, 表格的布局可能复杂, 表明子表格结构( 如嵌入页眉) 之间的不同依赖性。因此, 查询可能指不同范围的相关内容分布于这些结构中。此外, 这些系统无法概括到培训集中所看到的新情景。先前的方法仍然远离NLTR问题的一般性解决方案, 因为它们在处理复杂的表格布局或多颗粒质查询方面做得不够。为了解决这些问题, 我们提议基于图表的表格表格 Retrievval (GTR) 框架, 具有多色图图形表示的图形表达方式。在我们的框架中, 表格首先转换成一个图表, 有单元格节点、节点和列节点, 以捕捉捉到不同颗粒的复杂表格。然后, 表格将输入到一个图表变异模型模型模型模型,, 可以同时显示表格的当前系统内容的稳性格式, 和常规格式结构结构结构结构结构结构结构结构。,, 将显示我们基于表格的校正的校正的校正的校正的校正的校正。。