We consider the goodness-of fit testing problem for H\"older smooth densities over $\mathbb{R}^d$: given $n$ iid observations with unknown density $p$ and given a known density $p_0$, we investigate how large $\rho$ should be to distinguish, with high probability, the case $p=p_0$ from the composite alternative of all H\"older-smooth densities $p$ such that $\|p-p_0\|_t \geq \rho$ where $t \in [1,2]$. The densities are assumed to be defined over $\mathbb{R}^d$ and to have H\"older smoothness parameter $\alpha>0$. In the present work, we solve the case $\alpha \leq 1$ and handle the case $\alpha>1$ using an additional technical restriction on the densities. We identify matching upper and lower bounds on the local minimax rates of testing, given explicitly in terms of $p_0$. We propose novel test statistics which we believe could be of independent interest. We also establish the first definition of an explicit cutoff $u_B$ allowing us to split $\mathbb{R}^d$ into a bulk part (defined as the subset of $\mathbb{R}^d$ where $p_0$ takes only values greater than or equal to $u_B$) and a tail part (defined as the complementary of the bulk), each part involving fundamentally different contributions to the local minimax rates of testing.
翻译:Hölder连续密度的拟合优度检验:锐利的局部极小极小化率
翻译后的摘要:
我们考虑 Hölder光滑密度的拟合优度检验问题。给定一个未知密度 $p$ 和一个已知密度 $p_0$,假设都是在 $\mathbb{R}^d$ 上定义的。我们研究如何选择 $\rho$,才能在高概率下将 $p=p_0$ 与 $p$ 为所有 Hölder 光滑密度,且满足 $\|p-p_0\|_t \geq \rho$,其中 $t \in [1,2]$,以及 $p$ 满足 Hölder 平滑参数 $\alpha>0$,这两种情况加以区分。在本研究中,我们解决了 $\alpha \leq 1$ 的情况,并使用了对密度的额外技术限制来处理 $\alpha>1$ 的情况。我们在给定 $p_0$ 的情况下,确定了关于测试的局部极小极小化率的匹配上限和下限,并以 $p_0$ 为显式函数的形式给出。我们提出了一种新的测试统计量,我们认为这可能是值得独立探究的。我们还建立了第一个明确的截止值 $u_B$ 的定义,从而将 $\mathbb{R}^d$ 划分为一个主体部分(定义为 $p_0$ 只取大于或等于 $u_B$ 的值的 $\mathbb{R}^d$ 的子集)和一个尾部分(定义为与主体互补),每个部分对于检验的局部极小极小化率都有根本不同的贡献。