We develop a novel procedure for estimating the optimizer of general convex stochastic optimization problems of the form $\min_{x\in\mathcal{X}} \mathbf{E}[F(x,\xi)]$, when the given data is a finite independent sample selected according to $\xi$. The procedure is based on a median-of-means tournament, and is the first procedure that exhibits the optimal statistical performance in heavy tailed situations: we recover the asymptotic rates dictated by the central limit theorem in a non-asymptotic manner once the sample size exceeds some explicitly computable threshold. Additionally, our results apply in the high-dimensional setup, as the threshold sample size exhibits the optimal dependence on the dimension (up to a logarithmic factor). The general setting allows us to recover recent results on multivariate mean estimation and linear regression in heavy-tailed situations and to prove the first sharp, non-asymptotic results for the portfolio optimization problem.
翻译:当给定数据是按x1美元选择的有限独立样本时,我们开发了一种新程序,用以估计以$\min ⁇ x\in\mathcal{X ⁇ \\\\\\mathbf{E}[F(x,\xxxxxxx}}}美元为形式的普通混凝土优化问题的最佳优化。该程序以中值比赛为基础,也是在重尾情况下显示最佳统计性能的第一个程序:一旦样本大小超过某些明确的可计算阈值,我们就以非简单方式以非简单方式恢复中央限标定的无现时速率。此外,我们的结果适用于高维设置,因为阈值样本大小显示对尺寸的最佳依赖性(最高为对数系数 ) 。 总体设置使我们能够在重尾尾情况下恢复关于多变平均值估计和线性回归的最新结果,并证明组合优化问题的第一个尖锐、非被动的结果。