争取对适应性线性线性控制有一个无尺寸限制的了解 (Towards a Dimension-Free Understanding of Adaptive Linear Control) - 专知论文

会员服务 ·

0

线性的 · 可理解性 · 控制器 · Performer · 估计/估计量 ·

2021 年 3 月 19 日

Towards a Dimension-Free Understanding of Adaptive Linear Control

翻译：争取对适应性线性线性控制有一个无尺寸限制的了解

Juan C. Perdomo,Max Simchowitz,Alekh Agarwal,Peter Bartlett

We study the problem of adaptive control of the linear quadratic regulator for systems in very high, or even infinite dimension. We demonstrate that while sublinear regret requires finite dimensional inputs, the ambient state dimension of the system need not be bounded in order to perform online control. We provide the first regret bounds for LQR which hold for infinite dimensional systems, replacing dependence on ambient dimension with more natural notions of problem complexity. Our guarantees arise from a novel perturbation bound for certainty equivalence which scales with the prediction error in estimating the system parameters, without requiring consistent parameter recovery in more stringent measures like the operator norm. When specialized to finite dimensional settings, our bounds recover near optimal dimension and time horizon dependence.

翻译：我们研究了线性二次调节器在高度或甚至无限维度系统中的适应性控制问题。我们证明,尽管亚线性遗憾需要有限的次线性输入,但系统的环境状态层面不需要为进行在线控制而受约束。我们为持有无限维度系统的LQR提供了第一个遗憾界限,用更自然的复杂问题概念取代对环境层面的依赖。我们的保障产生于一种新颖的扰动,约束于确定性等同,在估计系统参数时,以预测误差为尺度,而不需要在操作员规范等更为严格的措施下进行一致的参数恢复。当我们专门使用有限维度设置时,我们的界限恢复到接近最佳维度和时间范围依赖。

0

相关内容

线性的

【CMU】最新深度学习课程， Introduction to Deep Learning

【CMU】最新深度学习课程， Introduction to Deep Learning

专知会员服务

38+阅读 · 2020年9月12日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

常识知识图谱的零样本学习，布朗大学

专知会员服务

40+阅读 · 2020年6月19日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

122+阅读 · 2020年5月30日

【IJCAI2020-华为诺亚】面向深度强化学习的策略迁移框架

【IJCAI2020-华为诺亚】面向深度强化学习的策略迁移框架

专知会员服务

28+阅读 · 2020年5月25日

【google】监督对比学习，Supervised Contrastive Learning

【google】监督对比学习，Supervised Contrastive Learning

专知会员服务

32+阅读 · 2020年4月23日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

【O'Reilly AI Conference 2019】当飞行比停下便宜时，When flying is cheaper than standing still，苏黎世联邦理工学院Raffaello D'Andrea教授

【O'Reilly AI Conference 2019】当飞行比停下便宜时，When flying is cheaper than standing still，苏黎世联邦理工学院Raffaello D'Andrea教授

专知会员服务

15+阅读 · 2019年11月5日

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

专知会员服务

18+阅读 · 2019年11月1日

【课程】普林斯顿大学19年春季学期《机器学习优化》课程讲义

【课程】普林斯顿大学19年春季学期《机器学习优化》课程讲义

专知会员服务

85+阅读 · 2019年10月29日

已删除

将门创投

3+阅读 · 2020年8月3日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知

21+阅读 · 2020年5月30日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Pointwise-in-time a posteriori error control for time-fractional parabolic equations

Arxiv

0+阅读 · 2021年5月12日

Accuracy controlled data assimilation for parabolic problems

Arxiv

0+阅读 · 2021年5月12日

High-Dimensional Experimental Design and Kernel Bandits

High-Dimensional Experimental Design and Kernel Bandits

Arxiv

0+阅读 · 2021年5月12日

A rigorous introduction for linear models

Arxiv

0+阅读 · 2021年5月10日

Model-Assisted Uniformly Honest Inference for Optimal Treatment Regimes in High Dimension

Arxiv

0+阅读 · 2021年5月10日

Contrastive Embeddings for Neural Architectures

Arxiv

0+阅读 · 2021年5月7日

Semi-Exact Control Functionals From Sard's Method

Arxiv

0+阅读 · 2021年5月6日

A semigroup method for high dimensional committor functions based on neural network

Arxiv

0+阅读 · 2021年5月5日

Kernelization of Maximum Minimal Vertex Cover

Arxiv

0+阅读 · 2021年5月4日

Towards Understanding Regularization in Batch Normalization

Towards Understanding Regularization in Batch Normalization

Arxiv

4+阅读 · 2018年9月27日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

【CMU】最新深度学习课程， Introduction to Deep Learning

【CMU】最新深度学习课程， Introduction to Deep Learning

专知会员服务

38+阅读 · 2020年9月12日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

常识知识图谱的零样本学习，布朗大学

专知会员服务

40+阅读 · 2020年6月19日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

122+阅读 · 2020年5月30日

【IJCAI2020-华为诺亚】面向深度强化学习的策略迁移框架

【IJCAI2020-华为诺亚】面向深度强化学习的策略迁移框架

专知会员服务

28+阅读 · 2020年5月25日

【google】监督对比学习，Supervised Contrastive Learning

【google】监督对比学习，Supervised Contrastive Learning

专知会员服务

32+阅读 · 2020年4月23日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

【O'Reilly AI Conference 2019】当飞行比停下便宜时，When flying is cheaper than standing still，苏黎世联邦理工学院Raffaello D'Andrea教授

【O'Reilly AI Conference 2019】当飞行比停下便宜时，When flying is cheaper than standing still，苏黎世联邦理工学院Raffaello D'Andrea教授

专知会员服务

15+阅读 · 2019年11月5日

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

专知会员服务

18+阅读 · 2019年11月1日

【课程】普林斯顿大学19年春季学期《机器学习优化》课程讲义

【课程】普林斯顿大学19年春季学期《机器学习优化》课程讲义

专知会员服务

85+阅读 · 2019年10月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

《生态建模密码破译：建模与编程实践》美陆军最新报告

大模型解决方案白皮书：社交陪伴场景全流程落地指南

面向具身操作的视觉-语言-动作模型综述

相关资讯

已删除

将门创投

3+阅读 · 2020年8月3日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知

21+阅读 · 2020年5月30日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Pointwise-in-time a posteriori error control for time-fractional parabolic equations

Arxiv

0+阅读 · 2021年5月12日

Accuracy controlled data assimilation for parabolic problems

Arxiv

0+阅读 · 2021年5月12日

High-Dimensional Experimental Design and Kernel Bandits

High-Dimensional Experimental Design and Kernel Bandits

Arxiv

0+阅读 · 2021年5月12日

A rigorous introduction for linear models

Arxiv

0+阅读 · 2021年5月10日

Model-Assisted Uniformly Honest Inference for Optimal Treatment Regimes in High Dimension

Arxiv

0+阅读 · 2021年5月10日

Contrastive Embeddings for Neural Architectures

Arxiv

0+阅读 · 2021年5月7日

Semi-Exact Control Functionals From Sard's Method

Arxiv

0+阅读 · 2021年5月6日

A semigroup method for high dimensional committor functions based on neural network

Arxiv

0+阅读 · 2021年5月5日

Kernelization of Maximum Minimal Vertex Cover

Arxiv

0+阅读 · 2021年5月4日

Towards Understanding Regularization in Batch Normalization

Towards Understanding Regularization in Batch Normalization

Arxiv

4+阅读 · 2018年9月27日

微信扫码咨询专知VIP会员