自适应LQR的几乎必然的$\sqrt{T}$遗憾界 (Almost Surely $\sqrt{T}$ Regret Bound for Adaptive LQR) - 专知论文

会员服务 ·

0

几乎必然 · 控制器 · 自适应 · 工业过程 · 收敛性 ·

2023 年 4 月 18 日

Almost Surely $\sqrt{T}$ Regret Bound for Adaptive LQR

翻译：自适应LQR的几乎必然的$\sqrt{T}$遗憾界

Yiwen Lu,Yilin Mo

The Linear-Quadratic Regulation (LQR) problem with unknown system parameters has been widely studied, but it has remained unclear whether $\tilde{ \mathcal{O}}(\sqrt{T})$ regret, which is the best known dependence on time, can be achieved almost surely. In this paper, we propose an adaptive LQR controller with almost surely $\tilde{ \mathcal{O}}(\sqrt{T})$ regret upper bound. The controller features a circuit-breaking mechanism, which circumvents potential safety breach and guarantees the convergence of the system parameter estimate, but is shown to be triggered only finitely often and hence has negligible effect on the asymptotic performance of the controller. The proposed controller is also validated via simulation on Tennessee Eastman Process~(TEP), a commonly used industrial process example.

翻译：在未知系统参数的情况下，线性二次调节（LQR）问题已经得到广泛研究，但仍然不清楚是否能够几乎肯定地实现$\tilde{ \mathcal{O}}(\sqrt{T})$遗憾上限，这是迄今为止对时间最好的已知依赖度。在本文中，我们提出了一种具有几乎必然的$\tilde{ \mathcal{O}}(\sqrt{T})$遗憾上界的自适应LQR控制器。该控制器具有断路器机制，可以避免潜在的安全风险并保证系统参数估计的收敛性，但是被证明只会被触发有限的次数，因此对控制器的渐近性能几乎没有影响。通过对田纳西东曼过程（Tennessee Eastman Process，简称TEP），一个常用的工业过程例子进行仿真，证明了所提出的控制器的有效性。

0

相关内容

几乎必然

【ICML2023】基于能量模型的奖励条件强化学习的贝叶斯重参数化

【ICML2023】基于能量模型的奖励条件强化学习的贝叶斯重参数化

专知会员服务

24+阅读 · 2023年5月23日

《多智能体任务规划》2022博士论文

《多智能体任务规划》2022博士论文

专知会员服务

285+阅读 · 2022年11月20日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【KDD2020】CAST:一种基于相关关系的多尺度数据自适应光谱聚类算法,CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

【KDD2020】CAST:一种基于相关关系的多尺度数据自适应光谱聚类算法,CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

专知会员服务

20+阅读 · 2020年6月11日

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

专知会员服务

12+阅读 · 2020年4月16日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

分数Brown运动驱动的随机微分方程随机分岔与遍历性的研究

国家自然科学基金

2+阅读 · 2015年12月31日

Markovian 跳变广义随机切换系统的稳定性及滑模控制与应用研究

国家自然科学基金

1+阅读 · 2014年12月31日

结构性稀疏信号的动态系统建模与恢复

国家自然科学基金

0+阅读 · 2013年12月31日

隶属度函数部分未知的T-S模糊系统有限频控制器设计

国家自然科学基金

0+阅读 · 2013年12月31日

不确定耦合PDE-ODE系统的自适应镇定

国家自然科学基金

0+阅读 · 2013年12月31日

随机跳跃系统的分析与综合

国家自然科学基金

0+阅读 · 2012年12月31日

广义受限系统的分析与优化设计

国家自然科学基金

0+阅读 · 2010年12月31日

具有SiISS逆动态的随机非线性系统的控制问题研究

国家自然科学基金

0+阅读 · 2009年12月31日

数字麦克风设计与噪声优化方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

时滞离散脉冲系统稳定、镇定与控制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Is Generative Modeling-based Stylization Necessary for Domain Adaptation in Regression Tasks?

Arxiv

0+阅读 · 2023年6月2日

Reward is enough for convex MDPs

Arxiv

0+阅读 · 2023年6月2日

Refined Regret for Adversarial MDPs with Linear Function Approximation

Arxiv

0+阅读 · 2023年6月1日

A General Framework for Equivariant Neural Networks on Reductive Lie Groups

Arxiv

0+阅读 · 2023年5月31日

DeepMerge: Deep Learning-Based Region-Merging for Image Segmentation

Arxiv

0+阅读 · 2023年5月31日

Constant or logarithmic regret in asynchronous multiplayer bandits

Arxiv

0+阅读 · 2023年5月31日

Asymptotic normality of robust risk minimizers

Arxiv

0+阅读 · 2023年5月30日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

VIP会员

文章信息

相关主题

相关VIP内容

【ICML2023】基于能量模型的奖励条件强化学习的贝叶斯重参数化

【ICML2023】基于能量模型的奖励条件强化学习的贝叶斯重参数化

专知会员服务

24+阅读 · 2023年5月23日

《多智能体任务规划》2022博士论文

《多智能体任务规划》2022博士论文

专知会员服务

285+阅读 · 2022年11月20日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【KDD2020】CAST:一种基于相关关系的多尺度数据自适应光谱聚类算法,CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

【KDD2020】CAST:一种基于相关关系的多尺度数据自适应光谱聚类算法,CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

专知会员服务

20+阅读 · 2020年6月11日

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

专知会员服务

12+阅读 · 2020年4月16日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

多模态大语言模型下游调优中“保持自我”的重要性

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

相关论文

Is Generative Modeling-based Stylization Necessary for Domain Adaptation in Regression Tasks?

Arxiv

0+阅读 · 2023年6月2日

Reward is enough for convex MDPs

Arxiv

0+阅读 · 2023年6月2日

Refined Regret for Adversarial MDPs with Linear Function Approximation

Arxiv

0+阅读 · 2023年6月1日

A General Framework for Equivariant Neural Networks on Reductive Lie Groups

Arxiv

0+阅读 · 2023年5月31日

DeepMerge: Deep Learning-Based Region-Merging for Image Segmentation

Arxiv

0+阅读 · 2023年5月31日

Constant or logarithmic regret in asynchronous multiplayer bandits

Arxiv

0+阅读 · 2023年5月31日

Asymptotic normality of robust risk minimizers

Arxiv

0+阅读 · 2023年5月30日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

相关基金

分数Brown运动驱动的随机微分方程随机分岔与遍历性的研究

国家自然科学基金

2+阅读 · 2015年12月31日

Markovian 跳变广义随机切换系统的稳定性及滑模控制与应用研究

国家自然科学基金

1+阅读 · 2014年12月31日

结构性稀疏信号的动态系统建模与恢复

国家自然科学基金

0+阅读 · 2013年12月31日

隶属度函数部分未知的T-S模糊系统有限频控制器设计

国家自然科学基金

0+阅读 · 2013年12月31日

不确定耦合PDE-ODE系统的自适应镇定

国家自然科学基金

0+阅读 · 2013年12月31日

随机跳跃系统的分析与综合

国家自然科学基金

0+阅读 · 2012年12月31日

广义受限系统的分析与优化设计

国家自然科学基金

0+阅读 · 2010年12月31日

具有SiISS逆动态的随机非线性系统的控制问题研究

国家自然科学基金

0+阅读 · 2009年12月31日

数字麦克风设计与噪声优化方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

时滞离散脉冲系统稳定、镇定与控制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员