We consider the classical problems of estimating the mean of an $n$-dimensional normally (with identity covariance matrix) or Poisson distributed vector under the squared loss. In a Bayesian setting the optimal estimator is given by the prior-dependent conditional mean. In a frequentist setting various shrinkage methods were developed over the last century. The framework of empirical Bayes, put forth by Robbins (1956), combines Bayesian and frequentist mindsets by postulating that the parameters are independent but with an unknown prior and aims to use a fully data-driven estimator to compete with the Bayesian oracle that knows the true prior. The central figure of merit is the regret, namely, the total excess risk over the Bayes risk in the worst case (over the priors). Although this paradigm was introduced more than 60 years ago, little is known about the asymptotic scaling of the optimal regret in the nonparametric setting. We show that for the Poisson model with compactly supported and subexponential priors, the optimal regret scales as $\Theta((\frac{\log n}{\log\log n})^2)$ and $\Theta(\log^3 n)$, respectively, both attained by the original estimator of Robbins. For the normal mean model, the regret is shown to be at least $\Omega((\frac{\log n}{\log\log n})^2)$ and $\Omega(\log^2 n)$ for compactly supported and subgaussian priors, respectively, the former of which resolves the conjecture of Singh (1979) on the impossibility of achieving bounded regret; before this work, the best regret lower bound was $\Omega(1)$. %Analogous results for subgaussian or subexponential priors are also obtained. In addition to the empirical Bayes setting, these results are shown to hold in the compound setting where the parameters are deterministic. As a side application, the construction in this paper also leads to improved or new lower bounds for mixture density estimation.
翻译:我们考虑了通常估算美元维度平均值( 带有身份共变矩阵) 或 Poisson 分布矢量值在平方损失下的典型问题。 在巴伊西亚设置时, 以先前依赖的有条件平均值给出了最佳估测符。 在常年设置时, 开发了各种缩水方法。 Robbins (1956年) 推出的经验性贝斯框架, 结合了巴伊西亚和常态心态, 假设参数是独立的, 但之前未知的参数, 目的是使用完全由数据驱动的估测仪来与了解真实损失的Bayesian 更低的调量 。 在巴伊西亚设置时, 最差的度估算值( 比以往的还要高 ) 。 虽然这个模型在60多年前被引入了。 在非相对偏差的环境下, 经验性思维模式是: 精度的模型, 在精度支持和亚亚亚亚基亚基值的模型, 最优的排序是 以美元为最低值 (\ remax) 。 在正数的正数中, 之前的计算结果 。