This is full length article (draft version) where problem number of topics in Topic Modeling is discussed. We proposed idea that Renyi and Tsallis entropy can be used for identification of optimal number in large textual collections. We also report results of numerical experiments of Semantic stability for 4 topic models, which shows that semantic stability play very important role in problem topic number. The calculation of Renyi and Tsallis entropy based on thermodynamics approach.
翻译:这是整个条款(草案文本),讨论专题模型中的问题数目,我们提出可以使用Renyi和Tsallis entropy来确定大文本集的最佳数目,我们还报告了4个专题模型的语义稳定性数字实验结果,这表明语义稳定性在问题专题编号中起着非常重要的作用。根据热动力学方法计算Renyi和Tsallis entropy。