Research on the construction of traditional information science methodology taxonomy is mostly conducted manually. From the limited corpus, researchers have attempted to summarize some of the research methodology entities into several abstract levels (generally three levels); however, they have been unable to provide a more granular hierarchy. Moreover, updating the methodology taxonomy is traditionally a slow process. In this study, we collected full-text academic papers related to information science. First, we constructed a basic methodology taxonomy with three levels by manual annotation. Then, the word vectors of the research methodology entities were trained using the full-text data. Accordingly, the research methodology entities were clustered and the basic methodology taxonomy was expanded using the clustering results to obtain a methodology taxonomy with more levels. This study provides new concepts for constructing a methodology taxonomy of information science. The proposed methodology taxonomy is semi-automated; it is more detailed than conventional schemes and the speed of taxonomy renewal has been enhanced.
翻译:关于传统信息科学方法分类学的构建研究大多是手工进行的,从有限的研究中,研究人员试图将一些研究方法实体归纳为几个抽象层次(一般分为三个层次);然而,他们无法提供更细的层次结构;此外,更新方法分类学传统上是一个缓慢的过程;我们在这项研究中收集了与信息科学有关的全文学术论文;首先,我们通过人工注解,建立了一个具有三个层次的基本方法分类学;然后,利用全文数据对研究方法实体的文字矢量进行了培训;因此,对研究方法实体进行了分组,并扩大了基本方法分类学的范围,以获得具有更高层次的方法分类学;这一研究为构建信息科学的方法分类学提供了新的概念;拟议的方法分类学是半自动化的;比常规计划更为详细,更新分类学的速度有所提高。