Establishing the limiting distribution of Chatterjee's rank correlation for a general, possibly non-independent, pair of random variables has been eagerly awaited to many. This paper shows that (a) Chatterjee's rank correlation is asymptotically normal as long as one variable is not a measurable function of the other, (b) the corresponding asymptotic variance is uniformly bounded by 36, and (c) a consistent variance estimator exists. Similar results also hold for Azadkia-Chatterjee's graph-based correlation coefficient, a multivariate analogue of Chatterjee's original proposal. The proof is given by appealing to H\'ajek representation and Chatterjee's nearest-neighbor CLT.
翻译:许多人急切地等待着对普通的、可能非独立的随机变量进行随机变量对查特杰等级相关性的有限分布。 本文显示 (a) 只要一个变量不是另一个变量的可测量功能,查特杰的等级相关性就无异于常态, (b) 相应的非现性差异一致地受36个变量的制约, (c) 始终存在差异估计器。 Azadkia-Chatterjee基于图表的关联系数也有类似结果,这是查特杰最初提案的多变量类比。 证据来自H\'ajek 和Chatterjee的近邻CLT。