Numerical reasoning over hybrid data containing tables and long texts has recently received research attention from the AI community. To generate an executable reasoning program consisting of math and table operations to answer a question, state-of-the-art methods use a retriever-generator pipeline. However, their retrieval results are static, while different generation steps may rely on different sentences. To attend to the retrieved information that is relevant to each generation step, in this paper, we propose DyRRen, an extended retriever-reranker-generator framework where each generation step is enhanced by a dynamic reranking of retrieved sentences. It outperforms existing baselines on the FinQA dataset.
翻译:对包含表格和长篇文字的混合数据进行数字推理最近引起了AI界的研究关注。为了产生一个由数学和表格操作组成的可执行推理程序以解答问题,最先进的方法使用检索器生成管道。但是,它们的检索结果是静态的,而不同的一代步骤可能依赖不同的句子。为了处理与每一代步骤相关的检索信息,我们在本文件中建议DyRRen,一个扩展的检索器-再排序器-生成器框架,通过动态的检索句子重新排行来增强每一代步骤。它比FinQA数据集的现有基线要强。