Procrastination, the irrational delay of tasks, is a common occurrence in online learning. Potential negative consequences include higher risk of drop-outs, increased stress, and reduced mood. Due to the rise of learning management systems and learning analytics, indicators of such behavior can be detected, enabling predictions of future procrastination and other dilatory behavior. However, research focusing on such predictions is scarce. Moreover, studies involving different types of predictors and comparisons between the predictive performance of various methods are virtually non-existent. In this study, we aim to fill these research gaps by analyzing the performance of multiple machine learning algorithms when predicting the delayed or timely submission of online assignments in a higher education setting with two categories of predictors: subjective, questionnaire-based variables and objective, log-data based indicators extracted from a learning management system. The results show that models with objective predictors consistently outperform models with subjective predictors, and a combination of both variable types perform slightly better. For each of these three options, a different approach prevailed (Gradient Boosting Machines for the subjective, Bayesian multilevel models for the objective, and Random Forest for the combined predictors). We conclude that careful attention should be paid to the selection of predictors and algorithms before implementing such models in learning management systems.
翻译:由于学习管理系统和学习分析的上升,可以检测到这种行为的指标,从而能够预测未来的拖延和其他拖延行为。然而,以这种预测为重点的研究很少。此外,涉及不同类型预测器的研究和不同方法预测性业绩的比较几乎不存在。在本研究中,我们的目标是通过分析多种机器学习算法的性能来填补这些研究差距,方法是在高等教育环境中预测延迟或及时提交在线任务时分析多种机器学习算法的性能,这种在线任务由两类预测器组成:主观的、基于问卷的变量和客观的、基于日志的指标从学习管理系统提取出来。结果显示,目标预测器的模型始终优于与主观预测器的成型模型,两种变量的组合效果则略好。对于这三种选择中每一种选择,我们采取了不同的做法(在客观、基于问卷的变量和客观的变量的基础上,以及基于日志的模型的基础上,在对目标进行认真的预测之前,我们应先对主观的多层次模型进行高级推导算,然后对结果进行精确的系统进行预测。