Automatic text analysis methods, such as Topic Modelling, are gaining much attention in Humanities. However, scholars need to have extensive coding skills to use such methods appropriately. The need of having this technical expertise prevents the broad adoption of these methods in Humanities research. In this paper, to help scholars in the Humanities to use Topic Modelling having no or limited coding skills, we introduce MITAO, a web-based tool that allow the definition of a visual workflow which embeds various automatic text analysis operations and allows one to store and share both the workflow and the results of its execution to other researchers, which enables the reproducibility of the analysis. We present an example of an application of use of Topic Modelling with MITAO using a collection of English abstracts of the articles published in "Umanistica Digitale". The results returned by MITAO are shown with dynamic web-based visualizations, which allowed us to have preliminary insights about the evolution of the topics treated over the time in the articles published in "Umanistica Digitale". All the results along with the defined workflows are published and accessible for further studies.
翻译:自动文本分析方法,如主题模型,在人文学中正在引起人们的极大注意。然而,学者们需要掌握广泛的编码技能,才能适当地使用这些方法。由于需要这种技术专长,因此无法在人文学研究中广泛采用这些方法。在本文中,为了帮助人文学学者使用没有或有限的编码技能的专题模型,我们引入了MITAO, 这是一种基于网络的工具,可以用来定义包含各种自动文本分析操作的视觉工作流程,并使得人们能够将工作流程及其执行结果储存和分享给其他研究人员,从而能够重新复制分析。我们举了一个实例,说明如何利用在“Umanistica Digitale”中发表的文章的英文摘要,与MITAO一起应用主题模型。MITAO的研究成果以动态网络图像显示,使我们能够初步了解在“Umanistica Digite”中发表的文章所处理的课题的演变情况。所有结果以及界定的工作流程都公布并可供进一步研究。