专题模型评价专题覆盖方法 (A Topic Coverage Approach to Evaluation of Topic Models)

from arxiv, Results and contributions unchanged; Added new references; Improved the contextualization and the description of the work (abstr, intro, 7.1 concl, rw, concl); Moved technical details of data and model building to appendices; Improved layout;

Topic models are widely used unsupervised models of text capable of learning topics - weighted lists of words and documents - from large collections of text documents. When topic models are used for discovery of topics in text collections, a question that arises naturally is how well the model-induced topics correspond to topics of interest to the analyst. In this paper we revisit and extend a so far neglected approach to topic model evaluation based on measuring topic coverage - computationally matching model topics with a set of reference topics that models are expected to uncover. The approach is well suited for analyzing models' performance in topic discovery and for large-scale analysis of both topic models and measures of model quality. We propose new measures of coverage and evaluate, in a series of experiments, different types of topic models on two distinct text domains for which interest for topic discovery exists. The experiments include evaluation of model quality, analysis of coverage of distinct topic categories, and the analysis of the relationship between coverage and other methods of topic model evaluation. The contributions of the paper include new measures of coverage, insights into both topic models and other methods of model evaluation, and the datasets and code for facilitating future research of both topic coverage and other approaches to topic model evaluation.

翻译：专题模型被广泛使用而不受监督的文本模型,能够从大量文本文件收集中学习专题 -- -- 加权的文字和文件清单 -- -- 大量文本文件;当专题模型用于在文本收集中发现专题时,自然产生的一个问题是,模型引起的专题与分析员感兴趣的专题对应的程度如何;在本文件中,我们重新审视并推广迄今为止被忽略的基于专题覆盖范围衡量的专题模型评价方法 -- -- 计算将模型专题与模型预期发现的一套参考专题相匹配;该方法非常适合分析专题发现模型的绩效,并大规模分析专题模型和模型质量的衡量标准;我们提出在一系列实验中,在专题发现感兴趣的两个不同不同的文本领域,对不同类型专题模型进行覆盖和评估的新措施;实验包括评估模型质量、分析不同专题类别的范围,以及分析专题模型评价覆盖面和其他方法之间的关系;该文件的贡献包括新的覆盖范围计量、对专题模型模型和其他模式评价方法的深入了解,以及便利今后对专题范围进行研究的数据集和代码。