Derivation in physics, in the form of derivation reconstruction of published results, is expensive and difficult to automate, not least because the use of mathematics by physicists is less formal than that of mathematicians. Following demand for informal mathematical datasets, we describe a dataset creation method where we consider a derivation agent as a finite state machine which exists in equational states represented by strings, where transitions can occur through a combination of string operations that mimic mathematics, and defined computer algebra operations. We present the novel dataset PhysAI-DS1 generated by this method, which consists of a curated derivation of a contemporary condensed matter physics result reconstructed using a computer algebra system. We define an equation reconstruction task based on formulating derivation segments as basic units of non-trivial state sequences, with the goal of reconstructing an unknown intermediate state equivalent to one-hop inference, extensible to the multi-hop case. We present a symbolic similarity-based heuristic approach to solve an equation reconstruction task on the PhysAI-DS1 dataset, which employs a set of actions, a knowledge base of symbols and equations, and a computer algebra system, to reconstruct an unknown intermediate state within a sequence of three equational states, grouped together as a derivation unit. Informal derivation comprehension of contemporary results is an important step towards the comprehension and automation of modern physics reasoners.
翻译:在对非正式数学数据集的需求之后,我们描述了一种数据集创建方法,我们把一种衍生剂视为一种有限的国家机器,存在于由字符串代表的等式状态中,通过模拟数学的字符串操作和界定计算机代数操作,可以实现过渡。我们介绍了由这种方法产生的新型数据集PhysAI-DS1, 其中包括利用计算机代数系统重建的当代浓缩物质物理结果的精密衍生结果。我们根据对非正式数学代数数据集的需求,我们描述了一种数据集创建方法,将这种衍生剂视为一种有限的国家机器,存在于以字符串为代表的方形状态中,通过模拟数学和定义的计算机代数操作组合进行过渡。我们介绍了一种象征性的基于超异性的方法,用以解决PhysAI-DS1数据集的方程重建任务,利用计算机代数系统重建现代精密物理物理物理物理结果,将一套不相近的计算公式和数学代数序列中等式系统用于一个不相近的计算机代数序列,这是一套对等式的模型和等式结构的模型。