In this work, we study contextual strongly convex simulation optimization and adopt an "optimize then predict" (OTP) approach for real-time decision making. In the offline stage, simulation optimization is conducted across a set of covariates to approximate the optimal-solution function; in the online stage, decisions are obtained by evaluating this approximation at the observed covariate. The central theoretical challenge is to understand how the inexactness of solutions generated by simulation-optimization algorithms affects the optimality gap, which is overlooked in existing studies. To address this, we develop a unified analysis framework that explicitly accounts for both solution bias and variance. Using Polyak-Ruppert averaging SGD as an illustrative simulation-optimization algorithm, we analyze the optimality gap of OTP under four representative smoothing techniques: $k$ nearest neighbor, kernel smoothing, linear regression, and kernel ridge regression. We establish convergence rates, derive the optimal allocation of the computational budget $Γ$ between the number of design covariates and the per-covariate simulation effort, and demonstrate the convergence rate can approximately achieve $Γ^{-1}$ under appropriate smoothing technique and sample-allocation rule. Finally, through a numerical study, we validate the theoretical findings and demonstrate the effectiveness and practical value of the proposed approach.
翻译:本文研究上下文强凸仿真优化问题,并采用“先优化后预测”方法进行实时决策。在离线阶段,通过一组协变量进行仿真优化以逼近最优解函数;在线阶段,则通过观测协变量评估该逼近函数来获得决策。核心理论挑战在于理解仿真优化算法产生的非精确解如何影响最优性差距——这一关键问题在现有研究中被忽视。为此,我们构建了一个统一分析框架,显式地考虑解偏差与方差的影响。以Polyak-Ruppert平均随机梯度下降法作为示例仿真优化算法,我们分析了四种典型平滑技术下OTP的最优性差距:$k$近邻法、核平滑、线性回归与核岭回归。我们建立了收敛速率,推导了计算预算$Γ$在设计协变量数量与单协变量仿真量之间的最优分配方案,并证明在适当的平滑技术与样本分配规则下,收敛速率可近似达到$Γ^{-1}$。最后通过数值实验验证了理论结果,并证明了所提方法的有效性与实用价值。