We consider the classical problems of estimating the mean of an $n$-dimensional normally (with identity covariance matrix) or Poisson distributed vector under the squared loss. In a Bayesian setting the optimal estimator is given by the prior-dependent conditional mean. In a frequentist setting various shrinkage methods were developed over the last century. The framework of empirical Bayes, put forth by Robbins (1956), combines Bayesian and frequentist mindsets by postulating that the parameters are independent but with an unknown prior and aims to use a fully data-driven estimator to compete with the Bayesian oracle that knows the true prior. The central figure of merit is the regret, namely, the total excess risk over the Bayes risk in the worst case (over the priors). Although this paradigm was introduced more than 60 years ago, little is known about the asymptotic scaling of the optimal regret in the nonparametric setting. We show that for the Poisson model with compactly supported and subexponential priors, the optimal regret scales as $\Theta((\frac{\log n}{\log\log n})^2)$ and $\Theta(\log^3 n)$, respectively, both attained by the original estimator of Robbins. For the normal mean model, the regret is shown to be at least $\Omega((\frac{\log n}{\log\log n})^2)$ and $\Omega(\log^2 n)$ for compactly supported and subgaussian priors, respectively, the former of which resolves the conjecture of Singh (1979) on the impossibility of achieving bounded regret; before this work, the best regret lower bound was $\Omega(1)$. In addition to the empirical Bayes setting, these results are shown to hold in the compound setting where the parameters are deterministic. As a side application, the construction in this paper also leads to improved or new lower bounds for density estimation of Gaussian and Poisson mixtures.
翻译:我们考虑了通常估算美元维度值的平均值( 带有身份共变矩阵) 或 Poisson 分布矢量在平方损失下的平均值的经典问题。 在巴伊西亚设置中, 以先前依赖的有条件平均值为最佳估计符。 在常年设置中, 开发了各种缩缩缩方法。 Robbins (1956) 推出的经验性贝斯框架, 将巴伊西亚和常态思维相结合, 假设参数是独立的, 但之前未知的参数, 目的是使用完全由数据驱动的估测器与了解真实损失的Bayesesian 更低的配置进行竞争。 在巴伊西亚设置中, 最差的估算值是: 最差的风险( 比先前的还要多 ) 。 虽然这个模型在60多年前被引入了。 在非相对偏差的环境下, 最差的脊椎模型的模型是, 最短的模型是新支持的和最差的, 最差的( 相对的) 硬度是, 最差的 美元 的, 之前的 DNA 和正数的 。