Analysis of acknowledgments is particularly interesting as acknowledgments may give information not only about funding, but they are also able to reveal hidden contributions to authorship and the researcher's collaboration patterns, context in which research was conducted, and specific aspects of the academic work. The focus of the present research is the analysis of a large sample of acknowledgement texts indexed in the Web of Science (WoS) Core Collection. Record types 'article' and 'review' from four different scientific domains, namely social sciences, economics, oceanography and computer science, published from 2014 to 2019 in a scientific journal in English were considered. Six types of acknowledged entities, i.e., funding agency, grant number, individuals, university, corporation and miscellaneous, were extracted from the acknowledgement texts using a Named Entity Recognition (NER) tagger and subsequently examined. A general analysis of the acknowledgement texts showed that indexing of funding information in WoS is incomplete. The analysis of the automatically extracted entities revealed differences and distinct patterns in the distribution of acknowledged entities of different types between different scientific domains. A strong association was found between acknowledged entity and scientific domain and acknowledged entity and entity type. Only negligible correlation was found between the number of citations and the number of acknowledged entities. Generally, the number of words in the acknowledgement texts positively correlates with the number of acknowledged funding organizations, universities, individuals and miscellaneous entities. At the same time, acknowledgement texts with the larger number of sentences have more acknowledged individuals and miscellaneous categories.
翻译:对承认的分析特别令人感兴趣,因为承认不仅可以提供有关资金的信息,而且还能够揭示对作者和研究人员的合作模式、开展研究的背景以及学术工作的具体方面作出的隐蔽贡献。本研究的重点是对科学(WoS)核心文献网络索引的确认文本的大量样本进行分析;记录四个不同的科学领域,即社会科学、经济、海洋学和计算机科学,从2014年至2019年在一份英文科学期刊上发表,它们也能够揭示对作者和研究人员合作模式的隐蔽贡献;六类公认的实体,即供资机构、赠款数目、个人、大学、公司和杂项,是从确认文件中摘录出来的,使用一个名列实体识别(NER)塔格尔,随后对其进行审查;对确认文件的总体分析表明,将社会科学(WoS)核心文献中的资金信息指数化不完全;对自动提取的实体的分析揭示了不同科学领域不同类型实体分布的差异和不同模式;发现公认的实体与科学领域和公认实体及实体类型之间有着强有力的联系;在确认的文件中,仅以微不足道的相对关系,在确认的索引和确认的篇幅中,在确认实体的数目中,还确认了实体的索引和确认的编号。