Classical asymptotic theory for statistical inference usually involves calibrating a statistic by fixing the dimension $d$ while letting the sample size $n$ increase to infinity. Recently, much effort has been dedicated towards understanding how these methods behave in high-dimensional settings, where $d_n$ and $n$ both increase to infinity together at some prescribed relative rate. This often leads to different inference procedures, depending on the assumptions about the dimensionality, leaving the practitioner in a bind: given a dataset with 100 samples in 20 dimensions, should they calibrate by assuming $n \gg d$, or $d_n/n \approx 0.2$? This paper considers the goal of dimension-agnostic inference -- developing methods whose validity does not depend on any assumption on $d_n$. We introduce a new, generic approach that uses variational representations of existing test statistics along with sample splitting and self-normalization to produce a new test statistic with a Gaussian limiting distribution. The resulting statistic can be viewed as a careful modification of degenerate U-statistics, dropping diagonal blocks and retaining off-diagonals. We exemplify our technique for a handful of classical problems including one-sample mean and covariance testing. Our tests are shown to have minimax rate-optimal power against appropriate local alternatives, and without explicitly targeting the high-dimensional setting their power is optimal up to a $\sqrt 2$ factor. A hidden advantage is that our proofs are simple and transparent. We end by describing several fruitful open directions.
翻译:典型的统计推断理论通常涉及通过确定维度来校准统计,确定维度值美元,同时让样本规模增加至无限值。最近,我们投入了大量精力来理解这些方法在高维环境中的运行方式,在高维环境中,美元和美元两者均以某种规定的相对速率提高至无限度。这往往导致不同的推论程序,这取决于对维度的假设,使从业者处于一种约束状态:如果有一个具有100个样本的20维度的数据集,它们使用美元=gg d$,或美元_n/n\approx 0.2美元?本文考虑了这些方法在高维度环境中如何运行,而美元和美元两者均不取决于对美元的任何假设。我们采用了一种新的、通用的方法,将现有测试统计数据与样本的分解和自我标准化结合起来,从而产生一种新的测试,而高比值为20维维度的分布。由此得出的统计可以被看作是一种谨慎地修改的U-statrical-alal-al-al-al-al-alviews real resental extial extial rodutional rodutional rogrational rodutional rodustration rodutional rodustration rodududustrational rodustrismal exmal roduction ex ex) a ex ex ex