Knowledge tracing (KT) aims to leverage students' learning histories to estimate their mastery levels on a set of pre-defined skills, based on which the corresponding future performance can be accurately predicted. In practice, a student's learning history comprises answers to sets of massed questions, each known as a session, rather than merely being a sequence of independent answers. Theoretically, within and across these sessions, students' learning dynamics can be very different. Therefore, how to effectively model the dynamics of students' knowledge states within and across the sessions is crucial for handling the KT problem. To address the above issue, we propose a novel hierarchical transformer model, named HiTSKT, comprising an interaction(-level) encoder to capture the knowledge a student acquires within a session, and a session(-level) encoder to summarise acquired knowledge across the past sessions. To predict an interaction in the current session, a knowledge retriever integrates the summarised past-session knowledge with the previous interactions' information into proper knowledge representations. Additionally, to model the student's long-term forgetting behaviour across the sessions, a power-law-decay attention mechanism is designed and deployed in the session encoder. Extensive experiments demonstrate that HiTSKT achieves new state-of-the-art performance on three public datasets, compared with six state-of-the-art baselines.
翻译:知识追踪(KT)的目的是利用学生的学习历史,根据一套预先界定的技能估计他们的掌握水平,据此可以准确预测相应的未来业绩。在实践中,学生的学习历史包括一系列大规模问题的答案,每个问题被称为一个会议,而不仅仅是一个独立答案的顺序。理论上,在这些课程之内和之间,学生的学习动态可能大不相同。因此,如何有效地模拟学生在课程内和课程之间的知识状态动态,对于处理KT问题至关重要。为了解决上述问题,我们建议了一个新的等级变压器模型,名为HITSKT, 包括一个互动(级别)编码器,以捕捉学生在课程中获得的知识,每个会议称为一个会议,而不仅仅是一个会议顺序(级别)的编码器,以总结以往课程中获得的知识。为了预测在本届会议中的互动,一个知识检索器将汇总的上学知识与以往互动的信息的动态纳入适当的知识陈述中。此外,为了模拟学生在课程中长期忘记行为,一个权力-法律-决定器编码器编码器,包括一个互动(级别)编码器,以捕捉取学生在课程中获得的知识,然后进行三州的数据实验。