This paper describes the design and implementation of a new machine learning model for online learning systems. We aim at improving the intelligent level of the systems by enabling an automated math word problem solver which can support a wide range of functions such as homework correction, difficulty estimation, and priority recommendation. We originally planned to employ existing models but realized that they processed a math word problem as a sequence or a homogeneous graph of tokens. Relationships between the multiple types of tokens such as entity, unit, rate, and number were ignored. We decided to design and implement a novel model to use such relational data to bridge the information gap between human-readable language and machine-understandable logical form. We propose a heterogeneous line graph transformer (HLGT) model that constructs a heterogeneous line graph via semantic role labeling on math word problems and then perform node representation learning aware of edge types. We add numerical comparison as an auxiliary task to improve model training for real-world use. Experimental results show that the proposed model achieves a better performance than existing models and suggest that it is still far below human performance. Information utilization and knowledge discovery is continuously needed to improve the online learning systems.
翻译:本文描述了用于在线学习系统的新机器学习模式的设计与实施。我们的目标是通过一个自动数学词问题解答器来提高系统的智能水平,该解答器能够支持诸如家庭作业校正、困难估计和优先建议等广泛功能。我们最初计划使用现有模型,但认识到它们处理数学字问题时是按顺序或同质符号图解排列。忽略了实体、单位、费率和数字等多种标志之间的关系。我们决定设计和实施一个新的模型,以使用这种关系数据来缩小人类可读语言和机器可理解逻辑形式之间的信息差距。我们提议了一个多式线形图变器(HLGT)模型,通过数学字词问题标识的语义作用构建一个多式线形图,然后进行不偏差的学习。我们添加数字比较作为辅助任务,以改进实体使用的模式培训。实验结果显示,拟议的模型比现有模型的性能要好得多,并建议它远远低于人类的性能。信息利用和知识发现需要不断改进在线学习系统。