This note is a short description of TeCoMiner, an interactive tool for exploring the topic content of text collections. Unlike other topic modeling tools, TeCoMiner is not based on some generative probabilistic model but on topological considerations about co-occurrence networks of terms. We outline the methods used for identifying topics, describe the features of the tool, and sketch an application, using a corpus of policy related scientific news on environmental issues published by the European Commission over the last decade.
翻译:本说明简短地描述了TeCoMiner(TeCoMiner),这是一个探讨文本集内容的交互式工具,与其他主题模型工具不同,TeCoMiner并非基于某种基因概率模型,而是基于对共同出现术语网络的从理论上考虑。我们概述了用于确定专题的方法,描述了工具的特征,并用欧洲联盟委员会在过去十年中发表的有关环境问题的一系列与政策有关的科学新闻,绘制了应用图。