The increasing adoption of technology to augment or even replace traditional face-to-face learning has led to the development of a myriad of tools and platforms aimed at engaging the students and facilitating the teacher's ability to present new information. The IMapBook project aims at improving the literacy and reading comprehension skills of elementary school-aged children by presenting them with interactive e-books and letting them take part in moderated book discussions. This study aims to develop and illustrate a machine learning-based approach to message classification that could be used to automatically notify the discussion moderator of a possible need for an intervention and also to collect other useful information about the ongoing discussion. We aim to predict whether a message posted in the discussion is relevant to the discussed book, whether the message is a statement, a question, or an answer, and in which broad category it can be classified. We incrementally enrich our used feature subsets and compare them using standard classification algorithms as well as the novel Feature stacking method. We use standard classification performance metrics as well as the Bayesian correlated t-test to show that the use of described methods in discussion moderation is feasible. Moving forward, we seek to attain better performance by focusing on extracting more of the significant information found in the strong temporal interdependence of the messages.
翻译:越来越多地采用技术来扩大甚至取代传统的面对面学习,这导致开发了各种工具和平台,旨在使学生参与进来,促进教师提供新信息的能力。IMapBook项目旨在通过向小学学龄儿童提供互动电子书,让他们参加经协调的图书讨论,提高他们的识字和阅读理解技能。本研究旨在开发和说明一种基于机械的基于学习的信息分类方法,可用来自动通知讨论主持人可能有必要进行干预,并收集关于正在进行的讨论的其他有用信息。我们的目标是预测讨论中公布的信息是否与讨论的书有关,该信息是否是一个声明、问题或答案,以及可以将其分类的广义类别。我们逐步丰富我们所使用的子集,并用标准分类算法和新颖的地貌堆叠法比较它们。我们使用标准分类性业绩指标和巴耶斯相关测试,以显示在讨论中使用所述方法是可行的。我们力求通过更注重获取重要信息,从而实现更好的相互依存性。我们力求通过更深入地提取发现的重要信息,从而实现更好的相互依存性。