The numerical reasoning in the financial domain -- performing quantitative analysis and summarizing the information from financial reports -- can greatly increase business efficiency and reduce costs of billions of dollars. Here, we propose a numerical reasoning question answering system to answer numerical reasoning questions among financial text and table data sources, consisting of a retriever module, a generator module, and an ensemble module. Specifically, in the retriever module, in addition to retrieving the whole row data, we innovatively design a cell retriever that retrieves the gold cells to avoid bringing unrelated and similar cells in the same row to the inputs of the generator module. In the generator module, we utilize multiple generators to produce programs, which are operation steps to answer the question. Finally, in the ensemble module, we integrate multiple programs to choose the best program as the output of our system. In the final private test set in FinQA Competition, our system obtains 69.79 execution accuracy.
翻译:财务领域的数字推理 -- -- 进行定量分析和总结财务报告中的信息 -- -- 可以大大提高业务效率,降低数十亿美元的成本。在这里,我们提议了一个数字推理解答系统,以回答财务文本和表格数据来源之间的数字推理问题,包括一个检索模块、一个发电机模块和一个混合模块。具体地说,在检索模块中,除了检索整个行的数据外,我们还创新地设计了一个细胞检索器,以检索金细胞,避免将同一行的不相干和相似的单元格带入发电机模块。在发电机模块中,我们使用多个生成器来生成程序,这是回答问题的操作步骤。最后,在组合模块中,我们整合了多个程序,选择最佳程序作为我们的系统输出。在FinQA竞争中的最后的私人测试组中,我们的系统获得了69.79执行精确度。