We develop a novel procedure for estimating the optimizer of general convex stochastic optimization problems of the form $\min_{x\in\mathcal{X}} \mathbb{E}[F(x,\xi)]$, when the given data is a finite independent sample selected according to $\xi$. The procedure is based on a median-of-means tournament, and is the first procedure that exhibits the optimal statistical performance in heavy tailed situations: we recover the asymptotic rates dictated by the central limit theorem in a non-asymptotic manner once the sample size exceeds some explicitly computable threshold. Additionally, our results apply in the high-dimensional setup, as the threshold sample size exhibits the optimal dependence on the dimension (up to a logarithmic factor). The general setting allows us to recover recent results on multivariate mean estimation and linear regression in heavy-tailed situations and to prove the first sharp, non-asymptotic results for the portfolio optimization problem.
翻译:当给定数据是按美元=xx,\xxxx]选择的有限独立样本时,我们开发了一种新程序,用以估计以美元为单位($min=%x\in\mathcal{X}{X ⁇ \\mathb{E}}[F(x,\xxxxx}}}}美元为单位(F(F)(F)x,\xxxxx}}E}美元为单位(F(F)x,xxxxx}}})的普通混凝土优化问题的最佳优化。该程序基于中位的中位量比赛,也是在严重尾尾端情况中显示最佳统计性表现的第一个程序:一旦样本大小超过某些明确的可计算阈值,我们就会以非非简单方式恢复中央限的无药效率。此外,我们的结果将应用在高维设置中,因为阈值的样本大小显示对维度的最佳依赖度(最多为对数系数因素) 。 总体设置允许我们恢复在严重尾尾量情况下的多变平均值估计和线性回归的最新结果,并证明组合优化问题的第一尖化、非随机结果。