We present an adaptive learning Intelligent Tutoring System, which uses model-based reinforcement learning in the form of contextual bandits to assign learning activities to students. The model is trained on the trajectories of thousands of students in order to maximize their exercise completion rates and continues to learn online, automatically adjusting itself to new activities. A randomized controlled trial with students shows that our model leads to superior completion rates and significantly improved student engagement when compared to other approaches. Our approach is fully-automated unlocking new opportunities for learning experience personalization.
翻译:我们推出一个适应性学习智能导师系统,该系统以背景强盗的形式使用基于模型的强化学习,为学生分配学习活动,该模型在数千名学生的轨迹上接受培训,以便最大限度地提高他们的练习完成率,并继续在线学习,自动适应新的活动。一个针对学生的随机控制的试验显示,与其它方法相比,我们的模型导致较高的结业率,并大大提高学生的参与程度。 我们的方法是完全自动化地释放学习个人化经验的新机会。