The Wilcoxon rank-sum test is one of the most popular distribution-free procedures for testing the equality of two univariate probability distributions. One of the main reasons for its popularity can be attributed to the remarkable result of Hodges and Lehmann (1956), which shows that the asymptotic relative efficiency of Wilcoxon's test with respect to Student's $t$-test, under location alternatives, never falls below 0.864, despite the former being exactly distribution-free for all sample sizes. Even more striking is the result of Chernoff and Savage (1958), which shows that the efficiency of a Gaussian score transformed Wilcoxon's test, against the $t$-test, is lower bounded by 1. In this paper we study the two-sample problem in the multivariate setting and propose distribution-free analogues of the Hotelling $T^2$ test (the natural multidimensional counterpart of Student's $t$-test) based on optimal transport and obtain extensions of the above celebrated results over various natural families of multivariate distributions. Our proposed tests are consistent against a general class of alternatives and satisfy Hodges-Lehmann and Chernoff-Savage-type efficiency lower bounds, despite being entirely agnostic to the underlying data generating mechanism. In particular, a collection of our proposed tests suffer from no loss in asymptotic efficiency, when compared to Hotelling $T^2$. To the best of our knowledge, these are the first collection of multivariate, nonparametric, exactly distribution-free tests that provably achieve such attractive efficiency lower bounds. We also demonstrate the broader scope of our methods in optimal transport based nonparametric inference by constructing exactly distribution-free multivariate tests for mutual independence, which suffer from no loss in asymptotic efficiency against the classical Wilks' likelihood ratio test, under Konijn alternatives.
翻译:Wilcoxon 类比测试是测试两种单体概率分布平等性的最受欢迎的无分配比率程序之一。 其受欢迎性的主要原因之一是Hodges和Lehmann(1956年)的显著结果,这表明Wilcoxon的测试对学生的美元测试相对效率在地点选项下从未低于0. 864, 尽管前者在所有样本尺寸上完全没有分配。 更引人注目的是Chernoff和Savage(1958年)的结果, 这表明高斯分的效益改变了Wilcoxon的测试, 而不是美元标准(1956年), 这表明Hodges和Lehmann(1956年)的测试结果显示, Wilcoxon的测试在多变量设置上存在两个模样性问题, 并提议在2美元测试中采用无分配的类比值( 学生的自然多维数相对值测试) 。 在最优的运输中,我们最优的替代的替代品在多体比值分布的自然家族中得到了更新。 我们的测试显示, 最优的测试在普通测试中也持续地展示了我们最低的等级的测试, 。