决策依赖性分发的斯托克散装点问题 (Stochastic Saddle Point Problems with Decision-Dependent Distributions)

This paper focuses on stochastic saddle point problems with decision-dependent distributions in both the static and time-varying settings. These are problems whose objective is the expected value of a stochastic payoff function, where random variables are drawn from a distribution induced by a distributional map. For general distributional maps, the problem of finding saddle points is in general computationally burdensome, even if the distribution is known. To enable a tractable solution approach, we introduce the notion of equilibrium points -- which are saddle points for the stationary stochastic minimax problem that they induce -- and provide conditions for their existence and uniqueness. We demonstrate that the distance between the two classes of solutions is bounded provided that the objective has a strongly-convex-strongly-concave payoff and Lipschitz continuous distributional map. We develop deterministic and stochastic primal-dual algorithms and demonstrate their convergence to the equilibrium point. In particular, by modeling errors emerging from a stochastic gradient estimator as sub-Weibull random variables, we provide error bounds in expectation and in high probability that hold for each iteration; moreover, we show convergence to a neighborhood in expectation and almost surely. Finally, we investigate a condition on the distributional map -- which we call opposing mixture dominance -- that ensures the objective is strongly-convex-strongly-concave. Under this assumption, we show that primal-dual algorithms converge to the saddle points in a similar fashion.

翻译：本文侧重于在静态和时间变化环境中基于决定的分布的悬浮马鞍问题。这些问题的客观目标是随机性报酬功能的预期值, 随机变量来自分布式地图的分布。对于一般分布式地图, 寻找马鞍点的问题一般是计算繁琐的, 即使分布是已知的。为了能够实现一个可移动的解决方案, 我们引入平衡点的概念, 它们是固定性随机微缩轴问题的缓冲点, 并为它们的存在和独特性提供条件。我们展示了两种相似的解决方案类型之间的距离, 只要目标具有很强的混固性强的混凝固性报酬支付和利普施奇茨连续分布式地图。我们开发了确定性和随机性定值的初步算法, 并展示了它们与平衡点的趋同。特别是, 通过模拟从固定性初切性梯度估测点产生的误差, 我们提供了它们的存在和独特性的条件。我们展示了两种相似性方法之间的距离, 只要该目标具有强烈的精确性, 就会显示每张定的稳定的市值分布。