The predict-then-optimize framework is fundamental in practical stochastic decision-making problems: first predict unknown parameters of an optimization model, then solve the problem using the predicted values. A natural loss function in this setting is defined by measuring the decision error induced by the predicted parameters, which was named the Smart Predict-then-Optimize (SPO) loss by Elmachtoub and Grigas [arXiv:1710.08005]. Since the SPO loss is typically nonconvex and possibly discontinuous, Elmachtoub and Grigas [arXiv:1710.08005] introduced a convex surrogate, called the SPO+ loss, that importantly accounts for the underlying structure of the optimization model. In this paper, we greatly expand upon the consistency results for the SPO+ loss provided by Elmachtoub and Grigas [arXiv:1710.08005]. We develop risk bounds and uniform calibration results for the SPO+ loss relative to the SPO loss, which provide a quantitative way to transfer the excess surrogate risk to excess true risk. By combining our risk bounds with generalization bounds, we show that the empirical minimizer of the SPO+ loss achieves low excess true risk with high probability. We first demonstrate these results in the case when the feasible region of the underlying optimization problem is a polyhedron, and then we show that the results can be strengthened substantially when the feasible region is a level set of a strongly convex function. We perform experiments to empirically demonstrate the strength of the SPO+ surrogate, as compared to standard $\ell_1$ and squared $\ell_2$ prediction error losses, on portfolio allocation and cost-sensitive multi-class classification problems.
翻译:预测时最佳化框架对于实际的随机决策问题至关重要:首先预测优化模型的未知参数,然后用预测值解决问题。在这一背景下,自然损失功能通过测量预测参数引起的决定错误来界定。预测参数称为Smart 预测时-即时-优化 Elmachtoub和Grigas[arXiv:17100.08005]的损失。由于SPO损失通常不是Convx,而且可能在很大程度上不具有可行性,Elmachtoub和Grigas[arXiv:171.0088005]。由于SPO损失通常不是美元,而且可能具有透明度, Elmachtoub和Grigas[arXiv:171.0008005]。由于SPO损失通常不是美元,Elmachtoub和Grigas[arXiv:171.00805] 损失通常没有风险,Elmachtold 变价和变价值损失与SPO损失相对,这为将超值的超值风险转移到超值的超值的超值超值超值超值超值的SPO+损失提供了量化方法。将SPO+l=lationlevorlevorlevill1,我们的风险与Smarxxxxx 的概率展示了这些风险的概率值的概率值的概率展示,当我们以显示为总风险的概率,我们总的概率展示的概率展示。