The increasing availability of curated citation data provides a wealth of resources for analyzing and understanding the intellectual influence of scientific publications. In the field of statistics, current studies of citation data have mostly focused on the interactions between statistical journals and papers, limiting the measure of influence to mainly within statistics itself. In this paper, we take the first step towards understanding the impact statistics has made on other scientific fields in the era of Big Data. By collecting comprehensive bibliometric data from the Web of Science database for selected statistical journals, we investigate the citation trends and compositions of citing fields over time to show that their diversity has been increasing. Furthermore, we use the local clustering technique involving personalized PageRank with conductance for size selection to find the most relevant statistical research area for a given external topic of interest. We provide theoretical guarantees for the procedure and, through a number of case studies, show the results from our citation data align well with our knowledge and intuition about these external topics. Overall, we have found that the statistical theory and methods recently invented by the statistics community have made increasing impact on other scientific fields.
翻译:在统计领域,目前对引证数据的研究主要侧重于统计期刊和论文之间的相互作用,将影响力的衡量限制在统计本身的范围内。在本文件中,我们迈出了第一步,以了解统计数据在大数据时代对其他科学领域的影响。我们从科学网络数据库收集了用于选定统计期刊的综合二元数据,从而调查了引用领域的引言趋势和构成,以表明其多样性一直在增加。此外,我们利用由个人化的PecelRank和进行规模选择的本地集束技术,为某一外部感兴趣的专题寻找最相关的统计研究领域。我们从理论上保证程序,并通过一些案例研究,表明我们的引证数据的结果与我们对这些外部专题的了解和直觉非常吻合。总体而言,我们发现统计界最近发明的统计理论和方法对其他科学领域的影响越来越大。