数据驱动数据驱动分配, 强力优化控制, 带有国家依赖的噪音</s> (Data-Driven Distributionally Robust Optimal Control with State-Dependent Noise) - 专知论文

会员服务 ·

0

噪声分布 · 噪声 · 稳健性 · 控制器 · 优化器 ·

2023 年 3 月 4 日

Data-Driven Distributionally Robust Optimal Control with State-Dependent Noise

翻译：数据驱动数据驱动分配, 强力优化控制, 带有国家依赖的噪音

Rui Liu,Guangyao Shi,Pratap Tokekar

This paper introduces innovative data-driven techniques for estimating the noise distribution and KL divergence bound for distributionally robust optimal control (DROC). The proposed approach addresses the limitation of traditional DROC approaches that require known ambiguity sets for the noise distribution, our approach can learn these distributions and bounds in real-world scenarios where they may not be known a priori. To evaluate the effectiveness of our approach, a navigation problem involving a car-like robot under different noise distributions is used as a numerical example. The results demonstrate that DROC combined with the proposed data-driven approaches, what we call D3ROC, provide robust and efficient control policies that outperform the traditional iterative linear quadratic Gaussian (iLQG) control approach. Moreover, it shows the effectiveness of our proposed approach in handling different noise distributions. Overall, the proposed approach offers a promising solution to real-world DROC problems where the noise distribution and KL divergence bounds may not be known a priori, increasing the practicality and applicability of the DROC framework.

翻译：本文介绍了用于估计噪音分布和KL差分以进行分配稳健最佳控制的创新数据驱动技术(DROC)。拟议办法处理传统的DROC方法的局限性,这些方法需要已知的噪音分布模棱两可,我们的方法可以在现实情景中了解这些分布和界限,在现实情景中可能无法事先知道这些分布和界限。为了评估我们的方法的有效性,不同噪音分布下涉及汽车类机器人的导航问题被用作数字例子。结果显示DROC与拟议的数据驱动方法(我们称之为D3ROC)相结合,提供了强有力和高效的控制政策,超过了传统的迭代线性象形高斯(iLQG)控制方法。此外,拟议办法显示了我们处理不同噪音分布的拟议方法的有效性。总体而言,拟议办法为现实世界DROC问题提供了一个很有希望的解决办法,因为噪音分布和KL差幅界限可能事先不为人所知,从而增加了DROC框架的实用性和适用性。</s>

0

相关内容

噪声分布

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

大神一年100篇论文

大神一年100篇论文

CreateAMind

15+阅读 · 2018年12月31日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

多倍体西瓜枯萎病抗性DNA甲基化调控机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

稠密轻质物质中原子核量子交换效应的分子动力学研究

国家自然科学基金

0+阅读 · 2015年12月31日

无尾飞翼布局飞行器的操纵面故障强化学习最优自适应补偿控制研究

国家自然科学基金

1+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

杏仁核-海马CA1区-前额叶皮层环路异常在抑郁发生中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

定量动态增强MRI评估骨质疏松骨血流灌注降低及其机制

国家自然科学基金

1+阅读 · 2012年12月31日

白藜芦醇调节STIM1抑制血管平滑肌细胞增殖机制的探讨

国家自然科学基金

0+阅读 · 2012年12月31日

Navier-Stokes方程的三角形cut-cell自适应有限元方法

国家自然科学基金

0+阅读 · 2011年12月31日

内质网应激在视网膜色素变性中的作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

EPO抑制创伤性脑水肿的分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation Processing

Arxiv

0+阅读 · 2023年4月26日

Data-driven reduced order models using invariant foliations, manifolds and autoencoders

Arxiv

0+阅读 · 2023年4月26日

Generating Adversarial Examples with Task Oriented Multi-Objective Optimization

Arxiv

0+阅读 · 2023年4月26日

Differential Privacy via Distributionally Robust Optimization

Arxiv

0+阅读 · 2023年4月25日

Causal Effect Estimation with Variational AutoEncoder and the Front Door Criterion

Arxiv

0+阅读 · 2023年4月24日

Approximate Order-Preserving Pattern Mining for Time Series

Arxiv

0+阅读 · 2023年4月23日

A Data-Driven Approach for Bayesian Uncertainty Quantification in Imaging

Arxiv

0+阅读 · 2023年4月21日

Under-Approximate Reachability Analysis for a Class of Linear Systems with Inputs

Arxiv

0+阅读 · 2023年4月20日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《分析与预测陆军战斗体能测试表现：统计与机器学习方法》2025最新137页

《军事行动中的人机协同共同学习》2025最新文献

代理式人工智能时代的决策优势

《F/A-18机队替换中队仿真模型的设计与分析》2025最新73页

相关资讯

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

大神一年100篇论文

大神一年100篇论文

CreateAMind

15+阅读 · 2018年12月31日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation Processing

Arxiv

0+阅读 · 2023年4月26日

Data-driven reduced order models using invariant foliations, manifolds and autoencoders

Arxiv

0+阅读 · 2023年4月26日

Generating Adversarial Examples with Task Oriented Multi-Objective Optimization

Arxiv

0+阅读 · 2023年4月26日

Differential Privacy via Distributionally Robust Optimization

Arxiv

0+阅读 · 2023年4月25日

Causal Effect Estimation with Variational AutoEncoder and the Front Door Criterion

Arxiv

0+阅读 · 2023年4月24日

Approximate Order-Preserving Pattern Mining for Time Series

Arxiv

0+阅读 · 2023年4月23日

A Data-Driven Approach for Bayesian Uncertainty Quantification in Imaging

Arxiv

0+阅读 · 2023年4月21日

Under-Approximate Reachability Analysis for a Class of Linear Systems with Inputs

Arxiv

0+阅读 · 2023年4月20日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

相关基金

多倍体西瓜枯萎病抗性DNA甲基化调控机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

稠密轻质物质中原子核量子交换效应的分子动力学研究

国家自然科学基金

0+阅读 · 2015年12月31日

无尾飞翼布局飞行器的操纵面故障强化学习最优自适应补偿控制研究

国家自然科学基金

1+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

杏仁核-海马CA1区-前额叶皮层环路异常在抑郁发生中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

定量动态增强MRI评估骨质疏松骨血流灌注降低及其机制

国家自然科学基金

1+阅读 · 2012年12月31日

白藜芦醇调节STIM1抑制血管平滑肌细胞增殖机制的探讨

国家自然科学基金

0+阅读 · 2012年12月31日

Navier-Stokes方程的三角形cut-cell自适应有限元方法

国家自然科学基金

0+阅读 · 2011年12月31日

内质网应激在视网膜色素变性中的作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

EPO抑制创伤性脑水肿的分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员