Introduction: When a learner fails to reach a milestone, educators often wonder if there had been any warning signs that could have allowed them to intervene sooner. Machine learning is used to predict which students are at risk of failing a national certifying exam. Predictions are made well in advance of the exam, such that educators can meaningfully intervene before students take the exam. Methods: Using already-collected, first-year student assessment data from four cohorts in a Master of Physician Assistant Studies program, the authors implement an "adaptive minimum match" version of the k-nearest neighbors algorithm (AMMKNN), using changing numbers of neighbors to predict each student's future exam scores on the Physician Assistant National Certifying Examination (PANCE). Leave-one-out cross validation (LOOCV) was used to evaluate the practical capabilities of this model, before making predictions for new students. Results: The best predictive model has an accuracy of 93%, sensitivity of 69%, and specificity of 94%. It generates a predicted PANCE score for each student, one year before they are scheduled to take the exam. Students can then be prospectively categorized into groups that need extra support, optional extra support, or no extra support. The educator then has one year to provide the appropriate customized support to each type of student. Conclusions: Predictive analytics can help health professions educators allocate scarce time and resources across their students. Interprofessional educators can use the included methods and code to generate predicted test outcomes for students. The authors recommend that educators using this or similar predictive methods act responsibly and transparently.
翻译:当一个学习者未能达到一个里程碑时,教育者往往会怀疑是否有任何预兆可以让他们更快地干预。机器学习被用来预测哪些学生可能面临国家认证考试失败的风险。在考试前很早就作出预测,这样教育者就可以在学生参加考试前进行有意义的干预。方法:使用物理学助学硕士课程中四个组群已经收集的第一年学生评估数据,作者们会使用K-近邻算法的“适应性最低匹配”版本,使用变化的邻居数来预测每个学生未来考试成绩。学生们在考试前提前预测。放假一课的交叉验证(LOOCV)用来评估这一模式的实际能力,然后对新学生进行预测。结果:最佳预测模型的准确度为93%,敏感度为69%,具体度为94%。它为每个学生制作了预测的Pance评分,在他们预定进行考试前的一年里,他们可以预测每个学生们未来考试分得的成绩。学生们可以选择一种选择的预选方法,然后可以将预选的成绩分类为每个学生们的成绩。