Geometry problem solving has attracted much attention in the NLP community recently. The task is challenging as it requires abstract problem understanding and symbolic reasoning with axiomatic knowledge. However, current datasets are either small in scale or not publicly available. Thus, we construct a new large-scale benchmark, Geometry3K, consisting of 3,002 geometry problems with dense annotation in formal language. We further propose a novel geometry solving approach with formal language and symbolic reasoning, called Interpretable Geometry Problem Solver (Inter-GPS). Inter-GPS first parses the problem text and diagram into formal language automatically via rule-based text parsing and neural object detecting, respectively. Unlike implicit learning in existing methods, Inter-GPS incorporates theorem knowledge as conditional rules and performs symbolic reasoning step by step. Also, a theorem predictor is designed to infer the theorem application sequence fed to the symbolic solver for the more efficient and reasonable searching path. Extensive experiments on the Geometry3K and GEOS datasets demonstrate that Inter-GPS achieves significant improvements over existing methods. The project with code and data is available at https://lupantech.github.io/inter-gps.
翻译:最近,国家定位方案社区对几何问题的解决引起了很大关注。任务具有挑战性,因为它需要抽象的对问题的理解和具有非同质知识的象征性推理。然而,目前的数据集不是规模小,就是不能公开提供。因此,我们建造了一个新的大型基准,即几何3K,由3 002个正式语言加密集注释的几何问题组成。我们进一步提议采用新的几何方法,采用正式语言和象征性推理,称为可解释的几何问题解答器(Inter-GPS)。GPS首先通过基于规则的文本解析和神经物体探测,将问题文本和图表自动化为正式语言。与现有方法的隐性学习不同,Inter-GPS将理论知识作为有条件的规则,并一步一步地进行象征性推理。此外,一个标注预测器旨在推断向更高效、更合理的搜索路径的象征性解答器提供的方应用序列。关于GPS和GOS数据集的广泛实验表明,In-GPS分别通过基于规则的文本解析和神经物体探测,与现有方法有显著的改进。在http://ubsterps。