RAT iLQR: 风险自动自动调试控制器到存储模型错配最佳账户 (RAT iLQR: A Risk Auto-Tuning Controller to Optimally Account for Stochastic Model Mismatch) - 专知论文

会员服务 ·

0

控制器 · 优化器 · 稳健性 · MoDELS · 散度 ·

2021 年 1 月 18 日

RAT iLQR: A Risk Auto-Tuning Controller to Optimally Account for Stochastic Model Mismatch

翻译：RAT iLQR: 风险自动自动调试控制器到存储模型错配最佳账户

Haruki Nishimura,Negar Mehr,Adrien Gaidon,Mac Schwager

from arxiv, To appear in IEEE Robotics and Automation Letters

Successful robotic operation in stochastic environments relies on accurate characterization of the underlying probability distributions, yet this is often imperfect due to limited knowledge. This work presents a control algorithm that is capable of handling such distributional mismatches. Specifically, we propose a novel nonlinear MPC for distributionally robust control, which plans locally optimal feedback policies against a worst-case distribution within a given KL divergence bound from a Gaussian distribution. Leveraging mathematical equivalence between distributionally robust control and risk-sensitive optimal control, our framework also provides an algorithm to dynamically adjust the risk-sensitivity level online for risk-sensitive control. The benefits of the distributional robustness as well as the automatic risk-sensitivity adjustment are demonstrated in a dynamic collision avoidance scenario where the predictive distribution of human motion is erroneous.

翻译：在随机环境中成功的机器人操作依赖于对潜在概率分布的准确描述,但由于知识有限,这往往不完美。这项工作提出了一个控制算法,能够处理这种分布不匹配。具体地说,我们提议采用一种新的非线性MPC来进行分配稳健控制,根据与高斯分布相约束的某个特定 KL 差异中最坏的分布情况,规划当地最佳反馈政策。利用分布稳健的控制和风险敏感度最佳控制之间的数学等值,我们的框架还提供一种算法来动态调整在线风险敏感度,以进行风险敏感控制。分布稳健性的好处以及自动风险敏感度调整在动态避免碰撞的假设中表现出来,在这种假设中,人类运动的预测分布是错误的。

0

相关内容

控制器

【Google】梯度下降，48页ppt

【Google】梯度下降，48页ppt

专知会员服务

81+阅读 · 2020年12月5日

【MLSS2020】流数据贝叶斯预测，米兰Sonia Petrone教授，80页ppt

【MLSS2020】流数据贝叶斯预测，米兰Sonia Petrone教授，80页ppt

专知会员服务

48+阅读 · 2020年7月5日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

252+阅读 · 2020年4月19日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【泡泡一分钟】ProbFlow:联合光流和不确定性估计

【泡泡一分钟】ProbFlow:联合光流和不确定性估计

泡泡机器人SLAM

3+阅读 · 2018年10月26日

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

专知

19+阅读 · 2018年6月26日

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

专知

11+阅读 · 2018年3月29日

carla无人驾驶模拟中文项目 carla_simulator_Chinese

carla无人驾驶模拟中文项目 carla_simulator_Chinese

CreateAMind

3+阅读 · 2018年1月30日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

基于LDA的主题模型实践（一）

基于LDA的主题模型实践（一）

机器学习深度学习实战原创交流

20+阅读 · 2015年9月9日

Learning with tree tensor networks: complexity estimates and model selection

Arxiv

0+阅读 · 2021年3月11日

A Deamortization Approach for Dynamic Spanner and Dynamic Maximal Matching

Arxiv

0+阅读 · 2021年3月11日

A One-step Approach to Covariate Shift Adaptation

Arxiv

0+阅读 · 2021年3月11日

Advancing Trajectory Optimization with Approximate Inference: Exploration, Covariance Control and Adaptive Risk

Arxiv

0+阅读 · 2021年3月10日

Entropy-regularized optimal transport on multivariate normal and q-normal distributions

Arxiv

0+阅读 · 2021年3月10日

Control and Trajectory Optimization for Soft Aerial Manipulation

Arxiv

0+阅读 · 2021年3月10日

Entropy-Guided Control Improvisation

Arxiv

0+阅读 · 2021年3月9日

Testing Matrix Rank, Optimally

Arxiv

3+阅读 · 2018年10月18日

Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Arxiv

7+阅读 · 2018年6月1日

Learning to Adapt: Meta-Learning for Model-Based Control

Arxiv

9+阅读 · 2018年3月30日

VIP会员

文章信息

相关主题

相关VIP内容

【Google】梯度下降，48页ppt

【Google】梯度下降，48页ppt

专知会员服务

81+阅读 · 2020年12月5日

【MLSS2020】流数据贝叶斯预测，米兰Sonia Petrone教授，80页ppt

【MLSS2020】流数据贝叶斯预测，米兰Sonia Petrone教授，80页ppt

专知会员服务

48+阅读 · 2020年7月5日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

252+阅读 · 2020年4月19日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】在低维与高维空间中对潜在表征的分析、建模与变换

《美军使用大语言模型技术生成领域特定文档》2025最新379页

【NeurIPS 2025】以语言为中心的全模态表征学习的可扩展性研究

智能体化多模态大语言模型综述

相关资讯

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【泡泡一分钟】ProbFlow:联合光流和不确定性估计

【泡泡一分钟】ProbFlow:联合光流和不确定性估计

泡泡机器人SLAM

3+阅读 · 2018年10月26日

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

专知

19+阅读 · 2018年6月26日

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

专知

11+阅读 · 2018年3月29日

carla无人驾驶模拟中文项目 carla_simulator_Chinese

carla无人驾驶模拟中文项目 carla_simulator_Chinese

CreateAMind

3+阅读 · 2018年1月30日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

基于LDA的主题模型实践（一）

基于LDA的主题模型实践（一）

机器学习深度学习实战原创交流

20+阅读 · 2015年9月9日

相关论文

Learning with tree tensor networks: complexity estimates and model selection

Arxiv

0+阅读 · 2021年3月11日

A Deamortization Approach for Dynamic Spanner and Dynamic Maximal Matching

Arxiv

0+阅读 · 2021年3月11日

A One-step Approach to Covariate Shift Adaptation

Arxiv

0+阅读 · 2021年3月11日

Advancing Trajectory Optimization with Approximate Inference: Exploration, Covariance Control and Adaptive Risk

Arxiv

0+阅读 · 2021年3月10日

Entropy-regularized optimal transport on multivariate normal and q-normal distributions

Arxiv

0+阅读 · 2021年3月10日

Control and Trajectory Optimization for Soft Aerial Manipulation

Arxiv

0+阅读 · 2021年3月10日

Entropy-Guided Control Improvisation

Arxiv

0+阅读 · 2021年3月9日

Testing Matrix Rank, Optimally

Arxiv

3+阅读 · 2018年10月18日

Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Arxiv

7+阅读 · 2018年6月1日

Learning to Adapt: Meta-Learning for Model-Based Control

Arxiv

9+阅读 · 2018年3月30日

微信扫码咨询专知VIP会员