Entropy integrals are widely used as a powerful empirical process tool to obtain upper bounds for the rates of convergence of global empirical risk minimizers (ERMs), in standard settings such as density estimation and regression. The upper bound for the convergence rates thus obtained typically matches the minimax lower bound when the entropy integral converges, but admits a strict gap compared to the lower bound when it diverges. Birg\'e and Massart [BM93] provided a striking example showing that such a gap is real with the entropy structure alone: for a variant of the natural H\"older class with low regularity, the global ERM actually converges at the rate predicted by the entropy integral that substantially deviates from the lower bound. The counter-example has spawned a long-standing negative position on the use of global ERMs in the regime where the entropy integral diverges, as they are heuristically believed to converge at a sub-optimal rate in a variety of models. The present paper demonstrates that this gap can be closed if the models admit certain degree of `set structures' in addition to the entropy structure. In other words, the global ERMs in such set structured models will indeed be rate-optimal, matching the lower bound even when the entropy integral diverges. The models with set structures we investigate include (i) image and edge estimation, (ii) binary classification, (iii) multiple isotonic regression, (iv) $s$-concave density estimation, all in general dimensions when the entropy integral diverges. Here set structures are interpreted broadly in the sense that the complexity of the underlying models can be essentially captured by the size of the empirical process over certain class of measurable sets, for which matching upper and lower bounds are obtained to facilitate the derivation of sharp convergence rates for the associated global ERMs.
翻译:在密度估计和回归等标准设置中,作为强大的实验性最小化全球实验风险最小化(ERMS)趋同率率的上限值被广泛用作获得全球实验风险最小化(ERMs)趋同率的上限值的强大实验过程工具。因此获得的趋同率的上限值通常与微小负下限值的下限值相匹配,但承认了与分差时较低约束值相比的严格差距。Birg\'e和Massart[BM93]提供了一个引人注目的例子,表明这种差距仅与进化结构存在真实性:对于自然的H\"老化者类别(ERMs)的变异性值,全球机构正值实际上与从下限差值组合值组合值所预测的比率相趋同。反倍数的反比值使得全球机构在使用全球机构风险管理机制中长期处于负状态,因为其内分差值(因为它们以超值相信在各种模型中的次均值比率趋一致,因此, 本文表明,如果模型承认某种“固定结构”的正值结构,则全球的正值结构的正位值值结构的内变差值值值值(在结构中,则会算值结构中,则会算算值中,则会算算值的数值结构的更低值的数值结构的数值结构中将包含某些值的数值值的数值值。