与斯托卡限制:零约束违规和强盗反馈 (Online Convex Optimization with Stochastic Constraints: Zero Constraint Violation and Bandit Feedback) - 专知论文

会员服务 ·

0

赌博机/老虎机 · 约束 · 优化器 · 在线 · CASE ·

2023 年 1 月 26 日

Online Convex Optimization with Stochastic Constraints: Zero Constraint Violation and Bandit Feedback

翻译：与斯托卡限制:零约束违规和强盗反馈

Yeongjong Kim,Dabeen Lee

This paper studies online convex optimization with stochastic constraints. We propose a variant of the drift-plus-penalty algorithm that guarantees $O(\sqrt{T})$ expected regret and zero constraint violation, after a fixed number of iterations, which improves the vanilla drift-plus-penalty method with $O(\sqrt{T})$ constraint violation. Our algorithm is oblivious to the length of the time horizon $T$, in contrast to the vanilla drift-plus-penalty method. This is based on our novel drift lemma that provides time-varying bounds on the virtual queue drift and, as a result, leads to time-varying bounds on the expected virtual queue length. Moreover, we extend our framework to stochastic-constrained online convex optimization under two-point bandit feedback. We show that by adapting our algorithmic framework to the bandit feedback setting, we may still achieve $O(\sqrt{T})$ expected regret and zero constraint violation, improving upon the previous work for the case of identical constraint functions. Numerical results demonstrate our theoretical results.

翻译：本文在网上研究“ 软盘优化” 和“ 软盘限制” 。我们提出一个替代的“ 软盘- 软盘- 软盘” 算法, 保证在固定的迭代次数后, 将预期的遗憾和零约束违反额($O)( sqrt{T}) 用于改善香草漂流- 软盘方法($O)(sqrt{T}) 的违反。我们的算法与香草流- 软盘- 软盘方法相反, 忽略了时间范围($T) 。这是基于我们的新颖的“ 漂流 Lemma ” 算法, 提供了虚拟队列漂移的时间轮圈, 从而导致虚拟队列长度的反常线。此外, 我们扩展了我们的框架, 在两点带宽的反馈下, 以随机调节的在线锥盘优化。我们通过调整我们的算法框架来适应“ 硬盘反馈设置 ”, 我们仍可以实现“ $(\ qrt{T) 的预期的“ 硬盘和零约束 ” 。

0

相关内容

赌博机/老虎机

赌博机/老虎机

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

63+阅读 · 2023年2月15日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

南大《优化方法（Optimization Methods》课程，推荐！

南大《优化方法（Optimization Methods》课程，推荐！

专知会员服务

80+阅读 · 2022年4月3日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Berezin变换及相关的算子理论

国家自然科学基金

1+阅读 · 2014年12月31日

偕二氟取代Combretastatins衍生物的设计与合成

国家自然科学基金

0+阅读 · 2014年12月31日

ATRP改性壳聚糖农药载体的设计及控制释放性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

Trx对鸡心肌细胞能量代谢的影响

国家自然科学基金

0+阅读 · 2013年12月31日

具有状态约束的Navier-Stokes方程的最优控制问题

国家自然科学基金

0+阅读 · 2013年12月31日

Periostin蛋白在乳腺癌转移前微环境中的功能及作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

临近空间浮空器流-固-热耦合动力学模型与飞行机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

糖酵解在APC-Cdh1调控缺血后星形胶质细胞反应性增殖中的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

转录因子GATA-4介导EGF信号调控心肌能量代谢及在心肌肥大中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

CyclinE/Cdk2相关蛋白Ankrd17在细胞周期调控中的功能研究

国家自然科学基金

0+阅读 · 2008年12月31日

Gyroid-like metamaterials: Topology optimization and Deep Learning

Arxiv

0+阅读 · 2023年3月17日

Diffusing the Optimal Topology: A Generative Optimization Approach

Arxiv

0+阅读 · 2023年3月17日

Optimal Volume-Sensitive Bounds for Polytope Approximation

Arxiv

0+阅读 · 2023年3月16日

Numerical modelling of wave propagation phenomena in thermo-poroelastic media via discontinuous Galerkin methods

Arxiv

0+阅读 · 2023年3月16日

A Stochastic Sequential Quadratic Optimization Algorithm for Nonlinear Equality Constrained Optimization with Rank-Deficient Jacobians

Arxiv

0+阅读 · 2023年3月16日

Randomized Kaczmarz method with adaptive stepsizes for inconsistent linear systems

Arxiv

0+阅读 · 2023年3月16日

Sequential Gaussian Processes for Online Learning of Nonstationary Functions

Arxiv

0+阅读 · 2023年3月16日

Multi-Robot Persistent Monitoring: Minimizing Latency and Number of Robots with Recharging Constraints

Arxiv

0+阅读 · 2023年3月15日

A Bregman-Kaczmarz method for nonlinear systems of equations

Arxiv

0+阅读 · 2023年3月15日

A scaling-invariant algorithm for linear programming whose running time depends only on the constraint matrix

Arxiv

0+阅读 · 2023年3月15日

VIP会员

文章信息

相关主题

赌博机/老虎机

相关VIP内容

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

63+阅读 · 2023年2月15日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

南大《优化方法（Optimization Methods》课程，推荐！

南大《优化方法（Optimization Methods》课程，推荐！

专知会员服务

80+阅读 · 2022年4月3日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

【ACMMM2025】EvoVLMA: 进化式视觉-语言模型自适应

智能体网络：用AI智能体编织下一代网络

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Gyroid-like metamaterials: Topology optimization and Deep Learning

Arxiv

0+阅读 · 2023年3月17日

Diffusing the Optimal Topology: A Generative Optimization Approach

Arxiv

0+阅读 · 2023年3月17日

Optimal Volume-Sensitive Bounds for Polytope Approximation

Arxiv

0+阅读 · 2023年3月16日

Numerical modelling of wave propagation phenomena in thermo-poroelastic media via discontinuous Galerkin methods

Arxiv

0+阅读 · 2023年3月16日

A Stochastic Sequential Quadratic Optimization Algorithm for Nonlinear Equality Constrained Optimization with Rank-Deficient Jacobians

Arxiv

0+阅读 · 2023年3月16日

Randomized Kaczmarz method with adaptive stepsizes for inconsistent linear systems

Arxiv

0+阅读 · 2023年3月16日

Sequential Gaussian Processes for Online Learning of Nonstationary Functions

Arxiv

0+阅读 · 2023年3月16日

Multi-Robot Persistent Monitoring: Minimizing Latency and Number of Robots with Recharging Constraints

Arxiv

0+阅读 · 2023年3月15日

A Bregman-Kaczmarz method for nonlinear systems of equations

Arxiv

0+阅读 · 2023年3月15日

A scaling-invariant algorithm for linear programming whose running time depends only on the constraint matrix

Arxiv

0+阅读 · 2023年3月15日

相关基金

Berezin变换及相关的算子理论

国家自然科学基金

1+阅读 · 2014年12月31日

偕二氟取代Combretastatins衍生物的设计与合成

国家自然科学基金

0+阅读 · 2014年12月31日

ATRP改性壳聚糖农药载体的设计及控制释放性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

Trx对鸡心肌细胞能量代谢的影响

国家自然科学基金

0+阅读 · 2013年12月31日

具有状态约束的Navier-Stokes方程的最优控制问题

国家自然科学基金

0+阅读 · 2013年12月31日

Periostin蛋白在乳腺癌转移前微环境中的功能及作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

临近空间浮空器流-固-热耦合动力学模型与飞行机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

糖酵解在APC-Cdh1调控缺血后星形胶质细胞反应性增殖中的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

转录因子GATA-4介导EGF信号调控心肌能量代谢及在心肌肥大中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

CyclinE/Cdk2相关蛋白Ankrd17在细胞周期调控中的功能研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员