标题：基于回归的模块化Lagrangian方法处理Packing和Covering约束条件下的上下文并排问题摘要：我们考虑具有线性约束条件的上下文并排问题（CBwLC），这是上下文并排的一种变体，其中算法在满足对总消耗的线性约束条件下消耗多个资源。该问题推广了具有背包约束条件的上下文并排（CBwK），允许进行Packing和Covering约束条件的正面和负面资源消耗。我们提供了第一个基于回归预测器的CBwLC（或CBwK）算法。该算法简单、计算效率高，并且具有消失保证。对于算法必须在某个约束条件被破坏时停止的CBwK变体，其为统计上的最优算法。此外，我们提供了第一个推广至异步环境下的消失保证的CBwLC（或CBwK）算法。我们通过识别一个较弱（且可能更公平）的基准来避免先前工作中的强不可能结果。我们的算法基于LagrangianBwK（Immorlica等人，FOCS2019）和SquareCB（Foster和Rakhlin，ICML2020）技术，利用了两个技术的内在模块化特性。 (Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via Regression)

翻译：标题：基于回归的模块化Lagrangian方法处理Packing和Covering约束条件下的上下文并排问题摘要：我们考虑具有线性约束条件的上下文并排问题（CBwLC），这是上下文并排的一种变体，其中算法在满足对总消耗的线性约束条件下消耗多个资源。该问题推广了具有背包约束条件的上下文并排（CBwK），允许进行Packing和Covering约束条件的正面和负面资源消耗。我们提供了第一个基于回归预测器的CBwLC（或CBwK）算法。该算法简单、计算效率高，并且具有消失保证。对于算法必须在某个约束条件被破坏时停止的CBwK变体，其为统计上的最优算法。此外，我们提供了第一个推广至异步环境下的消失保证的CBwLC（或CBwK）算法。我们通过识别一个较弱（且可能更公平）的基准来避免先前工作中的强不可能结果。我们的算法基于LagrangianBwK（Immorlica等人，FOCS2019）和SquareCB（Foster和Rakhlin，ICML2020）技术，利用了两个技术的内在模块化特性。

Aleksandrs Slivkins,Karthik Abinav Sankararaman,Dylan J. Foster

We consider contextual bandits with linear constraints (CBwLC), a variant of contextual bandits in which the algorithm consumes multiple resources subject to linear constraints on total consumption. This problem generalizes contextual bandits with knapsacks (CBwK), allowing for packing and covering constraints, as well as positive and negative resource consumption. We provide the first algorithm for CBwLC (or CBwK) that is based on regression oracles. The algorithm is simple, computationally efficient, and admits vanishing regret. It is statistically optimal for the variant of CBwK in which the algorithm must stop once some constraint is violated. Further, we provide the first vanishing-regret guarantees for CBwLC (or CBwK) that extend beyond the stochastic environment. We side-step strong impossibility results from prior work by identifying a weaker (and, arguably, fairer) benchmark to compare against. Our algorithm builds on LagrangeBwK (Immorlica et al., FOCS 2019), a Lagrangian-based technique for CBwK, and SquareCB (Foster and Rakhlin, ICML 2020), a regression-based technique for contextual bandits. Our analysis leverages the inherent modularity of both techniques.

翻译：