Human-in-the-loop topic modelling incorporates users' knowledge into the modelling process, enabling them to refine the model iteratively. Recent research has demonstrated the value of user feedback, but there are still issues to consider, such as the difficulty in tracking changes, comparing different models and the lack of evaluation based on real-world examples of use. We developed a novel, interactive human-in-the-loop topic modeling system with a user-friendly interface that enables users compare and record every step they take, and a novel topic words suggestion feature to help users provide feedback that is faithful to the ground truth. Our system also supports not only what traditional topic models can do, i.e., learning the topics from the whole corpus, but also targeted topic modelling, i.e., learning topics for specific aspects of the corpus. In this article, we provide an overview of the system and present the results of a series of user studies designed to assess the value of the system in progressively more realistic applications of topic modelling.
翻译:人机协同主题模型将用户的知识融合到建模过程中,使其能够迭代地完善模型。近期的研究表明了用户反馈的价值,但仍需考虑一些问题,如跟踪更改的难度、比较不同的模型以及缺乏基于真实应用案例的评估。我们开发了一个新颖的、交互式人机协同主题模型系统,具有用户友好的界面,能够使用户比较和记录他们所采取的每一个步骤,并提供一个新颖的主题词建议功能,以帮助用户提供忠实于真实情况的反馈。我们的系统不仅支持传统主题模型所能做到的从整个语料库学习主题,而且还支持针对特定语料库方面进行的主题建模。本文介绍了该系统概述,并介绍了一系列用户研究的结果,旨在逐步评估主题建模在更加真实的应用中的价值。