Artificial intelligence and semantic technologies are evolving and have been applied in various research areas, including the education domain. Higher Education institutions strive to improve students' academic performance. Early intervention to at-risk students and a reasonable curriculum is vital for students' success. Prior research opted for deploying traditional machine learning models to predict students' performance. In terms of curriculum semantic analysis, after conducting a comprehensive systematic review regarding the use of semantic technologies in the Computer Science curriculum, a major finding of the study is that technologies used to measure similarity have limitations in terms of accuracy and ambiguity in the representation of concepts, courses, etc. To fill these gaps, in this study, three implementations were developed, that is, to predict students' performance using marks from the previous semester, to model a course representation in a semantic way and compute the similarity, and to identify the prerequisite between two similar courses. Regarding performance prediction, we used the combination of Genetic Algorithm and Long-Short Term Memory (LSTM) on a dataset from a Brazilian university containing 248730 records. As for similarity measurement, we deployed BERT to encode the sentences and used cosine similarity to obtain the distance between courses. With respect to prerequisite identification, TextRazor was applied to extract concepts from course description, followed by employing SemRefD to measure the degree of prerequisite between two concepts. The outcomes of this study can be summarized as: (i) a breakthrough result improves Manrique's work by 2.5% in terms of accuracy in dropout prediction; (ii) uncover the similarity between courses based on course description; (iii) identify the prerequisite over three compulsory courses of School of Computing at ANU.
翻译:人工智能和语义技术正在不断发展,并应用于包括教育领域在内的各种研究领域。高等教育机构努力提高学生的学术表现。高等教育机构努力提高学生的学术表现。早期干预风险学生和合理课程对于学生的成功至关重要。先前的研究选择了使用传统的机器学习模型来预测学生的成绩。在课程语义分析方面,在对计算机科学课程中使用语义技术进行全面系统审查之后,研究的一项主要结论是,测量相似性的技术在概念、课程等的表述的准确性和模糊性方面受到限制。为填补这些差距,本研究中,开发了三个课程,即利用上半学期的标记来预测学生的成绩,以语义方式模拟课程的学习模式,对类似课程的成绩进行计算。在绩效预测方面,我们使用遗传Algorithm和Long-Sortimmerial(LSTM)的组合,用来改进巴西大学含有248730个课程的清晰度的数据集。关于相似性的测量,在课程中,我们使用BERTRE(R)在远程理解概念中采用两个术语的缩略性定义,在使用Semreditional 上,在使用两部的解算中可以使用。