As one of the challenging NLP tasks, designing math word problem (MWP) solvers has attracted increasing research attention for the past few years. In previous work, models designed by taking into account the properties of the binary tree structure of mathematical expressions at the output side have achieved better performance. However, the expressions corresponding to a MWP are often diverse (e.g., $n_1+n_2 \times n_3-n_4$, $n_3\times n_2-n_4+n_1$, etc.), and so are the corresponding binary trees, which creates difficulties in model learning due to the non-deterministic output space. In this paper, we propose the Structure-Unified M-Tree Coding Solver (SUMC-Solver), which applies a tree with any M branches (M-tree) to unify the output structures. To learn the M-tree, we use a mapping to convert the M-tree into the M-tree codes, where codes store the information of the paths from tree root to leaf nodes and the information of leaf nodes themselves, and then devise a Sequence-to-Code (seq2code) model to generate the codes. Experimental results on the widely used MAWPS and Math23K datasets have demonstrated that SUMC-Solver not only outperforms several state-of-the-art models under similar experimental settings but also performs much better under low-resource conditions.
翻译:作为具有挑战性的 NLP 任务之一, 设计数学词问题( MWP) 解答器在过去几年中引起了越来越多的研究关注。 在以往的工作中, 以考虑到输出方数学表达面数学二树结构特性而设计的模型取得了较好的性能。 但是, 与 MWP 相对应的表达式往往多种多样( 例如, $_ 1+n_ 2\ 2\ times n_ 3- n_ 4\ times n_ 3- n_ 3\ times n_ 2n_ 4+n_ 4+n_ 1 美元等), 相应的二进制树也引起了越来越多的研究关注。 在本文中, 我们建议了“ 结构统一的 M- 调解析器” (SUMMC- Solver), 将带有任何 M 分支( M- tree) 的树加以整合。 为了学习 M- tree, 我们使用一个映射图将 M- tree 模式转换成 M- tree 模式, 其中代码存储了从树根到叶节节节节节节点的路径信息, 以及类似叶节规则本身, 然后将使用SMAC 的 格式的模型进行更精确的计算结果。