在线DR-Sub-模块化与存储累积限制的在线最大化 (Online DR-Submodular Maximization with Stochastic Cumulative Constraints) - 专知论文

会员服务 ·

0

约束 · 极大 · 线性的 · 在线 · Continuity ·

2021 年 5 月 21 日

Online DR-Submodular Maximization with Stochastic Cumulative Constraints

翻译：在线DR-Sub-模块化与存储累积限制的在线最大化

Prasanna Sanjay Raut,Omid Sadeghi,Maryam Fazel

from arxiv, To appear in proceedings of AAAI 2021

In this paper, we consider online continuous DR-submodular maximization with linear stochastic long-term constraints. Compared to the prior work on online submodular maximization, our setting introduces the extra complication of stochastic linear constraint functions that are i.i.d. generated at each round. To be precise, at step $t\in\{1,\dots,T\}$, a DR-submodular utility function $f_t(\cdot)$ and a constraint vector $p_t$, i.i.d. generated from an unknown distribution with mean $p$, are revealed after committing to an action $x_t$ and we aim to maximize the overall utility while the expected cumulative resource consumption $\sum_{t=1}^T \langle p,x_t\rangle$ is below a fixed budget $B_T$. Stochastic long-term constraints arise naturally in applications where there is a limited budget or resource available and resource consumption at each step is governed by stochastically time-varying environments. We propose the Online Lagrangian Frank-Wolfe (OLFW) algorithm to solve this class of online problems. We analyze the performance of the OLFW algorithm and we obtain sub-linear regret bounds as well as sub-linear cumulative constraint violation bounds, both in expectation and with high probability.

翻译：在本文中, 我们考虑以线性随机长期限制在网上持续 DR- Submodal 最大化。与先前的在线子modal最大化工作相比, 我们的设置引入了每轮产生的i. id. 生成的随机线性约束功能的额外复杂性。准确地说, 在 $t\ in\\\\\\\\ 1,\ dots, T ⁇ $, 一个 DR- Submodal 通用功能 $f_ t (\ cdot) 和限制矢量 $p_ td 的制约矢量 $p_ t。与先前的以美元为单位的未知分配相比, 在承诺行动 $_ t$ 之后, 我们的设置了最大效用, 而预期的资源累积消费 $\ sum% t=1\\\ t\ langle p, x_ t\ rangle$ 低于固定预算 $B_ T$。长期限制自然出现在应用程序中, 当预算或资源可用性限制有限且每步内的资源消耗受时间变化变化中的时间环境的制约时, 我们方- 将OLO- Wal- com- tragal- translate asim asim ex asimst ex ex eximstal eximstal deal eximstal eximst as the sal subligal

0

相关内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【RLChina2020公开课】Lecture-11.pdf【多智能体学习与游戏AI前沿】

【RLChina2020公开课】Lecture-11.pdf【多智能体学习与游戏AI前沿】

专知会员服务

27+阅读 · 2020年8月6日

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

专知会员服务

52+阅读 · 2020年5月26日

【NeurIPS 2019|经典论文奖】正则随机学习和在线优化的双重平均法（Dual Averaging Method for Regularized Stochastic Learning and Online Optimization），微软研究院Lin Xiao

【NeurIPS 2019|经典论文奖】正则随机学习和在线优化的双重平均法（Dual Averaging Method for Regularized Stochastic Learning and Online Optimization），微软研究院Lin Xiao

专知会员服务

17+阅读 · 2019年12月9日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

197+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

专知会员服务

26+阅读 · 2019年10月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

创业邦杂志

5+阅读 · 2019年3月27日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

人工智能 | 国际会议信息10条

人工智能 | 国际会议信息10条

Call4Papers

5+阅读 · 2018年12月18日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

神经网络学习率设置

神经网络学习率设置

机器学习研究会

4+阅读 · 2018年3月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Maximal inequalities for stochastic convolutions and pathwise uniform convergence of time discretisation schemes

Arxiv

0+阅读 · 2021年7月12日

Continuous Time Bandits With Sampling Costs

Arxiv

0+阅读 · 2021年7月12日

Periodic Updates for Constrained OCO with Application to Large-Scale Multi-Antenna Systems

Arxiv

0+阅读 · 2021年7月12日

Open-Loop Equilibrium Strategies for Dynamic Influence Maximization Game Over Social Networks

Arxiv

0+阅读 · 2021年7月11日

Stochastic optimization with momentum: convergence, fluctuations, and traps avoidance

Arxiv

0+阅读 · 2021年7月10日

Optimal Gradient-based Algorithms for Non-concave Bandit Optimization

Arxiv

0+阅读 · 2021年7月9日

The Best of Many Worlds: Dual Mirror Descent for Online Allocation Problems

The Best of Many Worlds: Dual Mirror Descent for Online Allocation Problems

Arxiv

0+阅读 · 2021年7月9日

Fisher Information and Mutual Information Constraints

Arxiv

0+阅读 · 2021年7月8日

Density Constrained Reinforcement Learning

Arxiv

6+阅读 · 2021年6月24日

Hierarchical Adaptive Contextual Bandits for Resource Constraint based Recommendation

Arxiv

5+阅读 · 2020年4月2日

VIP会员

文章信息

相关主题

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【RLChina2020公开课】Lecture-11.pdf【多智能体学习与游戏AI前沿】

【RLChina2020公开课】Lecture-11.pdf【多智能体学习与游戏AI前沿】

专知会员服务

27+阅读 · 2020年8月6日

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

专知会员服务

52+阅读 · 2020年5月26日

【NeurIPS 2019|经典论文奖】正则随机学习和在线优化的双重平均法（Dual Averaging Method for Regularized Stochastic Learning and Online Optimization），微软研究院Lin Xiao

【NeurIPS 2019|经典论文奖】正则随机学习和在线优化的双重平均法（Dual Averaging Method for Regularized Stochastic Learning and Online Optimization），微软研究院Lin Xiao

专知会员服务

17+阅读 · 2019年12月9日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

197+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

专知会员服务

26+阅读 · 2019年10月3日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

创业邦杂志

5+阅读 · 2019年3月27日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

人工智能 | 国际会议信息10条

人工智能 | 国际会议信息10条

Call4Papers

5+阅读 · 2018年12月18日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

神经网络学习率设置

神经网络学习率设置

机器学习研究会

4+阅读 · 2018年3月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Maximal inequalities for stochastic convolutions and pathwise uniform convergence of time discretisation schemes

Arxiv

0+阅读 · 2021年7月12日

Continuous Time Bandits With Sampling Costs

Arxiv

0+阅读 · 2021年7月12日

Periodic Updates for Constrained OCO with Application to Large-Scale Multi-Antenna Systems

Arxiv

0+阅读 · 2021年7月12日

Open-Loop Equilibrium Strategies for Dynamic Influence Maximization Game Over Social Networks

Arxiv

0+阅读 · 2021年7月11日

Stochastic optimization with momentum: convergence, fluctuations, and traps avoidance

Arxiv

0+阅读 · 2021年7月10日

Optimal Gradient-based Algorithms for Non-concave Bandit Optimization

Arxiv

0+阅读 · 2021年7月9日

The Best of Many Worlds: Dual Mirror Descent for Online Allocation Problems

The Best of Many Worlds: Dual Mirror Descent for Online Allocation Problems

Arxiv

0+阅读 · 2021年7月9日

Fisher Information and Mutual Information Constraints

Arxiv

0+阅读 · 2021年7月8日

Density Constrained Reinforcement Learning

Arxiv

6+阅读 · 2021年6月24日

Hierarchical Adaptive Contextual Bandits for Resource Constraint based Recommendation

Arxiv

5+阅读 · 2020年4月2日

微信扫码咨询专知VIP会员