最终到最终预测和优化流程的风险保障 (Risk Guarantees for End-to-End Prediction and Optimization Processes)

Prediction models are often employed in estimating parameters of optimization models. Despite the fact that in an end-to-end view, the real goal is to achieve good optimization performance, the prediction performance is measured on its own. While it is usually believed that good prediction performance in estimating the parameters will result in good subsequent optimization performance, formal theoretical guarantees on this are notably lacking. In this paper, we explore conditions that allow us to explicitly describe how the prediction performance governs the optimization performance. Our weaker condition allows for an asymptotic convergence result, while our stronger condition allows for exact quantification of the optimization performance in terms of the prediction performance. In general, verification of these conditions is a non-trivial task. Nevertheless, we show that our weaker condition is equivalent to the well-known Fisher consistency concept from the learning theory literature. This then allows us to easily check our weaker condition for several loss functions. We also establish that the squared error loss function satisfies our stronger condition. Consequently, we derive the exact theoretical relationship between prediction performance measured with the squared loss, as well as a class of symmetric loss functions, and the subsequent optimization performance. In a computational study on portfolio optimization, fractional knapsack and multiclass classification problems, we compare the optimization performance of using of several prediction loss functions (some that are Fisher consistent and some that are not) and demonstrate that lack of consistency of the loss function can indeed have a detrimental effect on performance.

翻译：预测模型往往用于估计优化模型的参数。尽管从端到端,真正的目标是实现最佳业绩,但预测业绩是自行衡量的。虽然通常认为在估计参数方面的良好预测业绩将带来良好的随后最佳业绩,但这方面的正式理论保障明显缺乏。在本文件中,我们探索了允许我们明确描述预测业绩如何调节优化业绩的条件。我们的弱项条件允许得出一个零乐观的趋同结果,而我们的强条件允许精确量化预测业绩的最佳业绩。总的来说,对这些条件的核查是非三重性任务。尽管如此,我们通常认为,在估计参数方面,良好的预测业绩将会导致良好的随后最佳业绩,但通常认为在估计参数方面,良好的预测业绩将会导致良好的最佳业绩,这使我们很容易检查若干损失功能的较弱状况。我们还确定,平方错误损失功能满足了我们的较强的条件。因此,我们从准确的假设业绩与平方损失之间的理论关系,以及从对损失功能的分类中得出准确的量化,以及随后的优化业绩是非三重性的工作。然而,我们表明,我们的较弱状况的劣性状况相当于数种最坏的机能的计算学分损失等级的功能,我们比较了多种最差损失等级的模型的计算和多级的亏损等级的计算性能的计算,的确的计算和不力的计算性能的计算性能的计算性能的计算性能是,对错。