This paper describes the design and implementation of a new machine learning model for online learning systems.We aim at improving the intelligent level of the systems by enabling an automated math word problem solver which can support a wide range of functions such as homework correction, difficulty estimation, and priority recommendation. We originally planned to employ existing models but realized that they processed a math word problem as a sequence or a homogeneous graph of tokens. Relationships between the multiple types of tokens such as entity, unit, rate, and number were ignored.We decided to design and implement a novel model to use such relational data to bridge the information gap between human-readable language and machine-understandable logical form. We propose a heterogeneous line graph transformer (HLGT) model that constructs a heterogeneous line graph via semantic role labeling on math word problems and then perform node representation learning aware of edge types. We add numerical comparison as an auxiliary task to improve model training for real-world use. Experimental results show that the proposed model achieves a better performance than existing models and suggest that it is still far below human performance. Information utilization and knowledge discovery is continuously needed to improve the online learning systems.
翻译:本文描述了用于在线学习系统的新机器学习模式的设计与实施。 我们的目标是通过一个自动数学词问题解答器来提高系统的智能水平,该解答器能够支持诸如家庭作业校正、困难估计和优先建议等广泛功能。 我们最初计划使用现有模型,但认识到它们处理数学字问题时是按顺序或同质符号图解排列。 忽略了实体、单位、费率和数字等多种标志之间的关系。 我们决定设计和实施一个新的模型,以便使用这种关系数据来缩小人类可读语言和机器可理解逻辑形式之间的信息差距。 我们提议了一个多式线形图变器(HLGT)模型,通过数学字词问题标记的语义作用构建一个混杂的线条形图,然后进行不记号的学习,了解边缘类型。 我们添加数字比较作为辅助任务来改进真实世界使用的模型培训。 实验结果显示,拟议的模型比现有模型取得更好的性能,并表明它远远低于人类的性能。 信息利用和知识发现需要不断改进在线学习系统。