关于PageRank和MarkovRank的级别统计 (On rank statistics of PageRank and MarkovRank)

An important statistic in analyzing some (finite) network data, called \emph{PageRank}, and a related new statistic, which we call \emph{MarkovRank}, are studied in this paper. The PageRank was originally developed by the cofounders of \emph{Google}, Sergey Brin and Larry Page, to optimize the ranking of websites for their search engine outcomes, and it is computed using an iterative algorithm, based on the idea that nodes with a larger number of incoming edges are more important. The aim of this paper is to analyze the common features and some significant differences between the PageRank and the new Rank. A common merit of the two Ranks is that both statistics can be easily computed by either the mathematical computation or the iterative algorithm. According to the analysis of some examples, these two statistics seem to return somewhat different values, but the resulting rank statistics of both statistics are not far away from each other. One of the differences is that only MarkovRank has the property that its rank statistic does not depend on any tuning parameter, and it is determined only through the adjacency matrix for given network data. Moreover, it is also shown that the rank statistic of MarkovRank coincides with that of the stationary distribution vector of the corresponding Markov chain (with finite state space) whenever the Markov chain is regular. Thus, MarkovRank may have a potential to play a similar role to the well established PageRank from a practical point of view, not only commonly with light computational tasks, but also with some new theoretical validity.

翻译：在分析一些(freite) 网络数据(称为 emph{ PageRank} ) 的重要统计, 以及我们称之为 emph{ MarkovRank} 的相关新统计(我们称之为 emph{ MarkovRank} ), 本文对此进行了研究。 PageRank 最初是由 emph{ Google} 、 Sergey Brin 和 Larry Page 的创始人开发的, 目的是优化网站的搜索引擎结果的排名, 并使用迭接算法进行计算, 其依据的理念是: 实际进入边缘较多的节点的节点更为重要。本文的目的是分析PageRank 和新阶点之间的共同特征和一些重大差异。两层的常见优点是, 这两种统计数据都可以通过数学计算或迭接式算法来计算。这两种统计数据似乎都具有某种不同的值, 但由此得出的等级统计数据并不相距对方很远。其中一种差异是, 只有MarkovRank, 其等级统计的属性并不取决于任何调试调的矢中值参数, 只要 R 正常的轨值的值, 就能显示一个固定的网络的分布。