以国家依赖的 Markov 数据优化 (Constrained Stochastic Nonconvex Optimization with State-dependent Markov Data) - 专知论文

会员服务 ·

0

Markov · 优化器 · 非凸 · Oracle · Learning ·

2022 年 11 月 9 日

Constrained Stochastic Nonconvex Optimization with State-dependent Markov Data

翻译：以国家依赖的 Markov 数据优化

Abhishek Roy,Krishnakumar Balasubramanian,Saeed Ghadimi

from arxiv, 2 figures

We study stochastic optimization algorithms for constrained nonconvex stochastic optimization problems with Markovian data. In particular, we focus on the case when the transition kernel of the Markov chain is state-dependent. Such stochastic optimization problems arise in various machine learning problems including strategic classification and reinforcement learning. For this problem, we study both projection-based and projection-free algorithms. In both cases, we establish that the number of calls to the stochastic first-order oracle to obtain an appropriately defined $\epsilon$-stationary point is of the order $\mathcal{O}(1/\epsilon^{2.5})$. In the projection-free setting we additionally establish that the number of calls to the linear minimization oracle is of order $\mathcal{O}(1/\epsilon^{5.5})$. We also empirically demonstrate the performance of our algorithm on the problem of strategic classification with neural networks.

翻译：我们用Markovian 数据来研究限制的非convex 蒸汽优化问题的随机优化算法。特别是, 我们侧重于马尔科夫链的过渡内核是否依赖国家的情况。这种随机优化问题出现在各种机器学习问题中, 包括战略分类和强化学习。对于这个问题, 我们既研究基于预测的算法, 也研究无投射的算法。在这两种情况下, 我们确定调用随机第一阶点获得适当定义的 $\ epsilon$- 静止点的次数是 $\ mathcal{O} (1/\\ epsilon\\ 2.5}) 的顺序。在无预测的设置中, 我们进一步确定线性最小化或线性电弧的调数为 $\ mathcal{O} (1/\\ epsilon ⁇ 5} 。我们还用实验性地展示了我们在神经网络的战略分类问题上的算法表现。

0

相关内容

Markov

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

246+阅读 · 2019年10月21日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

肺炎支原体外排泵ABC Transporter在大环内酯类耐药中的作用机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

Runx3基因DNA甲基化介导BPD肺上皮细胞转分化的作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

骨髓间充质干细胞旁分泌CTRP3水平影响心肌梗死疗效及机制

国家自然科学基金

0+阅读 · 2014年12月31日

调控Beclin1/Atg7等介导的自噬对胰岛ε/β细胞分化失衡的影响及其机制

国家自然科学基金

0+阅读 · 2012年12月31日

microRNA调节肿瘤抑制因子Caliban应答DNA损伤的机制

国家自然科学基金

1+阅读 · 2012年12月31日

Stochastic Langevin Differential Inclusions with Applications to Machine Learning

Arxiv

0+阅读 · 2023年1月3日

On Bilevel Optimization without Lower-level Strong Convexity

Arxiv

0+阅读 · 2023年1月2日

A Sequential Quadratic Programming Method with High Probability Complexity Bounds for Nonlinear Equality Constrained Stochastic Optimization

Arxiv

0+阅读 · 2023年1月1日

An Auction-based Coordination Strategy for Task-Constrained Multi-Agent Stochastic Planning with Submodular Rewards

Arxiv

0+阅读 · 2022年12月30日

An Optimal Algorithm for Strongly Convex Min-min Optimization

Arxiv

0+阅读 · 2022年12月29日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

246+阅读 · 2019年10月21日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】面向真实世界音视联合语音识别的可扩展框架

《通过仿真与开源数据提升战略决策：机遇与局限》最新报告

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

评估大语言模型在科学发现中的作用

相关资讯

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

相关论文

Stochastic Langevin Differential Inclusions with Applications to Machine Learning

Arxiv

0+阅读 · 2023年1月3日

On Bilevel Optimization without Lower-level Strong Convexity

Arxiv

0+阅读 · 2023年1月2日

A Sequential Quadratic Programming Method with High Probability Complexity Bounds for Nonlinear Equality Constrained Stochastic Optimization

Arxiv

0+阅读 · 2023年1月1日

An Auction-based Coordination Strategy for Task-Constrained Multi-Agent Stochastic Planning with Submodular Rewards

Arxiv

0+阅读 · 2022年12月30日

An Optimal Algorithm for Strongly Convex Min-min Optimization

Arxiv

0+阅读 · 2022年12月29日

相关基金

肺炎支原体外排泵ABC Transporter在大环内酯类耐药中的作用机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

Runx3基因DNA甲基化介导BPD肺上皮细胞转分化的作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

骨髓间充质干细胞旁分泌CTRP3水平影响心肌梗死疗效及机制

国家自然科学基金

0+阅读 · 2014年12月31日

调控Beclin1/Atg7等介导的自噬对胰岛ε/β细胞分化失衡的影响及其机制

国家自然科学基金

0+阅读 · 2012年12月31日

microRNA调节肿瘤抑制因子Caliban应答DNA损伤的机制

国家自然科学基金

1+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员