A margin-free measure of bivariate association generalizing Spearman's rho to the case of non-monotonic dependence is defined in terms of two square integrable functions on the unit interval. Properties of generalized Spearman correlation are investigated when the functions are piecewise continuous and strictly monotonic, with particular focus on the special cases where the functions are drawn from orthonormal bases defined by Legendre polynomials and cosine functions. For continuous random variables, generalized Spearman correlation is treated as a copula-based measure and shown to depend on a pair of uniform-distribution-preserving (udp) transformations determined by the underlying functions. Bounds for generalized Spearman correlation are derived and a novel technique referred to as stochastic inversion of udp transformations is used to construct singular copulas that attain the bounds and parametric copulas with densities that interpolate between the bounds and model different degrees of non-monotonic dependence. Sample analogues of generalized Spearman correlation are proposed and their asymptotic and small-sample properties are investigated. Potential applications of the theory are demonstrated including: exploratory analyses of the dependence structures of datasets and their symmetries; elicitation of functions maximizing generalized Spearman correlation via expansions in orthonormal basis functions; and construction of tractable probability densities to model a wide variety of non-monotonic dependencies.
翻译:本文定义了一种无边际的双变量关联测度,将斯皮尔曼ρ推广至非单调相依情形,该测度基于单位区间上的两个平方可积函数。当函数为分段连续且严格单调时,研究了广义斯皮尔曼相关系数的性质,特别聚焦于函数取自勒让德多项式和余弦函数定义的正交基的特殊情形。对于连续随机变量,广义斯皮尔曼相关系数被视为基于连接函数的测度,并证明其依赖于由底层函数确定的一对均匀分布保持变换。推导了广义斯皮尔曼相关系数的边界,并采用一种称为均匀分布保持变换随机逆的新技术,构建了达到边界的奇异连接函数以及具有密度函数的参数连接函数,这些密度函数在边界之间插值并建模不同程度的非单调相依性。提出了广义斯皮尔曼相关系数的样本类比,并研究了其渐近性质与小样本性质。展示了该理论的潜在应用,包括:数据集的相依结构及其对称性的探索性分析;通过正交基函数展开来推导最大化广义斯皮尔曼相关系数的函数;以及构建易于处理的概率密度以建模多种非单调相依模式。