与斯托查搜索一起的适应性风险敏感模型预测控制 (Adaptive Risk Sensitive Model Predictive Control with Stochastic Search) - 专知论文

会员服务 ·

0

优化器 · 控制器 · MoDELS · 动力系统 · 强化学习 ·

2021 年 2 月 12 日

Adaptive Risk Sensitive Model Predictive Control with Stochastic Search

翻译：与斯托查搜索一起的适应性风险敏感模型预测控制

Ziyi Wang,Oswin So,Keuntaek Lee,Camilo A. Duarte,Evangelos A. Theodorou

We present a general framework for optimizing the Conditional Value-at-Risk for dynamical systems using stochastic search. The framework is capable of handling the uncertainty from the initial condition, stochastic dynamics, and uncertain parameters in the model. The algorithm is compared against a risk-sensitive distributional reinforcement learning framework and demonstrates outperformance on a pendulum and cartpole with stochastic dynamics. We also showcase the applicability of the framework to robotics as an adaptive risk-sensitive controller by optimizing with respect to the fully nonlinear belief provided by a particle filter on a pendulum, cartpole, and quadcopter in simulation.

翻译：我们提出了一个利用随机搜索优化动态系统有条件值风险的一般框架。框架能够处理模型初始状态、随机动态和不确定参数的不确定性。算法与风险敏感分布强化学习框架进行了比较,并展示了在带有随机动态的钟摆和马车上的表现。我们还展示了框架对作为适应性风险敏感控制器的机器人的适用性,优化了在模拟中通过粒子过滤器提供的完全非线性信念。

0

相关内容

优化器

Python编程基础，121页ppt

Python编程基础，121页ppt

专知会员服务

49+阅读 · 2021年1月1日

不可错过！最新《大规模机器学习》2020教程，133页ppt，台湾清华大学吴尚鸿教授

不可错过！最新《大规模机器学习》2020教程，133页ppt，台湾清华大学吴尚鸿教授

专知会员服务

58+阅读 · 2020年11月8日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

迁移学习简明教程，11页ppt

迁移学习简明教程，11页ppt

专知会员服务

109+阅读 · 2020年8月4日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

已删除

将门创投

5+阅读 · 2017年10月20日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Online Multi-agent Reinforcement Learning for Decentralized Inverter-based Volt-VAR Control

Arxiv

0+阅读 · 2021年4月7日

Stein Variational Model Predictive Control

Arxiv

0+阅读 · 2021年4月7日

The Value of Planning for Infinite-Horizon Model Predictive Control

Arxiv

0+阅读 · 2021年4月7日

Discrete time approximation of fully nonlinear HJB equations via stochastic control problems under the $G$-expectation framework

Arxiv

0+阅读 · 2021年4月6日

Particle MPC for Uncertain and Learning-Based Control

Arxiv

0+阅读 · 2021年4月6日

SOLO: Search Online, Learn Offline for Combinatorial Optimization Problems

Arxiv

0+阅读 · 2021年4月4日

A Dynamics Perspective of Pursuit-Evasion Games of Intelligent Agents with the Ability to Learn

Arxiv

0+阅读 · 2021年4月3日

Exponential Reduction in Sample Complexity with Learning of Ising Model Dynamics

Exponential Reduction in Sample Complexity with Learning of Ising Model Dynamics

Arxiv

0+阅读 · 2021年4月2日

NOVAS: Non-convex Optimization via Adaptive Stochastic Search for End-to-End Learning and Control

Arxiv

0+阅读 · 2021年4月1日

Safety-aware Adaptive Reinforcement Learning with Applications to Brushbot Navigation

Arxiv

4+阅读 · 2018年1月29日

VIP会员

文章信息

相关主题

相关VIP内容

Python编程基础，121页ppt

Python编程基础，121页ppt

专知会员服务

49+阅读 · 2021年1月1日

不可错过！最新《大规模机器学习》2020教程，133页ppt，台湾清华大学吴尚鸿教授

不可错过！最新《大规模机器学习》2020教程，133页ppt，台湾清华大学吴尚鸿教授

专知会员服务

58+阅读 · 2020年11月8日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

迁移学习简明教程，11页ppt

迁移学习简明教程，11页ppt

专知会员服务

109+阅读 · 2020年8月4日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《城市滨海地区：理解复杂多变环境下的指挥控制框架》50页报告

《理解城市战及其在俄乌战争中的表现》报告

美空军“顶点2025”实验：推进AI在C2、动态目标锁定与联盟集成中的应用

《建设式兵棋模拟作为战术集群配置优化的关键组成部分》

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

已删除

将门创投

5+阅读 · 2017年10月20日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Online Multi-agent Reinforcement Learning for Decentralized Inverter-based Volt-VAR Control

Arxiv

0+阅读 · 2021年4月7日

Stein Variational Model Predictive Control

Arxiv

0+阅读 · 2021年4月7日

The Value of Planning for Infinite-Horizon Model Predictive Control

Arxiv

0+阅读 · 2021年4月7日

Discrete time approximation of fully nonlinear HJB equations via stochastic control problems under the $G$-expectation framework

Arxiv

0+阅读 · 2021年4月6日

Particle MPC for Uncertain and Learning-Based Control

Arxiv

0+阅读 · 2021年4月6日

SOLO: Search Online, Learn Offline for Combinatorial Optimization Problems

Arxiv

0+阅读 · 2021年4月4日

A Dynamics Perspective of Pursuit-Evasion Games of Intelligent Agents with the Ability to Learn

Arxiv

0+阅读 · 2021年4月3日

Exponential Reduction in Sample Complexity with Learning of Ising Model Dynamics

Exponential Reduction in Sample Complexity with Learning of Ising Model Dynamics

Arxiv

0+阅读 · 2021年4月2日

NOVAS: Non-convex Optimization via Adaptive Stochastic Search for End-to-End Learning and Control

Arxiv

0+阅读 · 2021年4月1日

Safety-aware Adaptive Reinforcement Learning with Applications to Brushbot Navigation

Arxiv

4+阅读 · 2018年1月29日

微信扫码咨询专知VIP会员