The pursuit of knowledge is the permanent goal of human beings. Scientific literature, as the major medium that carries knowledge between scientists, exhibits explosive growth during the last century. Despite the frequent use of many tangible measures, such as citation, impact factor and g-index, to quantify the influence of papers from different perspectives based on scientific productivity, it has not yet been well understood how the relationship between scientific productivity and knowledge amount turns out to be, i.e., how the knowledge value of papers and knowledge amount vary with development of the discipline. This raises the question of whether high scientific productivity equals large knowledge amount. Here, building on rich literature on academic conferences and journals, we collect 185 million articles covering 19 disciplines published during 1970 to 2020, and establish citation network research area to represent the knowledge flow from the authors of the article being cited to the authors of the articles that cite it under each specific area. As a result, the structure formed during the evolution of each scientific area can implicitly tells how the knowledge flows between nodes and how it behaves as the number of literature (productivity) increases. By leveraging Structural entropy in structured high-dimensional space and Shannon entropy in unstructured probability space, we propose the Quantitative Index of Knowledge (KQI), which is taken as the subtraction between the two types of entropy, to reflect the extent of disorder difference (knowledge amount) caused by structure (order). With the aid of KQI, we find that, although the published literature shows an explosive growth, the amount of knowledge (KQI) contained in it obviously slows down, and there is a threshold after which the growth of knowledge accelerates...
翻译:科学文献作为科学家之间知识传播的主要媒介,在上个世纪中呈现爆炸性的增长。尽管我们经常使用许多有形措施,如引文、影响系数和g-index,从科学生产力的不同角度量化论文的影响,但人们尚未充分理解科学生产力和知识之间的关系是如何变化的,即:随着学科的发展,文件和知识的知识价值和知识数额如何不同。这提出了高科学生产力是否等于大量知识水平的问题。在这里,根据学术会议和期刊的丰富文献,我们收集了涵盖1970至2020年期间出版的19个学科的1.85亿篇文章,并建立了引文网络研究领域,以代表文章作者根据科学生产力和知识的不同角度对论文的影响。结果,每个科学领域演变过程中形成的结构可以隐含知识如何在节点之间流动,以及知识如何随着文学(生产力)数量的增长而变化。我们通过利用结构化的空间空间和期刊结构结构结构结构化和香农读文研究所的文献数量来加速知识流 。我们用知识水平来反映数字性Q 和数字化数据结构化的数值,我们用数字来显示数字化的数值的数值。