自主气球控制：基于H-无穷鲁棒深度残差强化学习的方法 (Autonomous Blimp Control via H-infinity Robust Deep Residual Reinforcement Learning) - 专知论文

会员服务 ·

0

深度残差强化学习 · 鲁棒 · 控制器 · PID · 强化学习 ·

2023 年 3 月 24 日

Autonomous Blimp Control via H-infinity Robust Deep Residual Reinforcement Learning

翻译：自主气球控制：基于H-无穷鲁棒深度残差强化学习的方法

Yang Zuo,Yu Tang Liu,Aamir Ahmad

Due to their superior energy efficiency, blimps may replace quadcopters for long-duration aerial tasks. However, designing a controller for blimps to handle complex dynamics, modeling errors, and disturbances remains an unsolved challenge. One recent work combines reinforcement learning (RL) and a PID controller to address this challenge and demonstrates its effectiveness in real-world experiments. In the current work, we build on that using an H-infinity robust controller to expand the stability margin and improve the RL agent's performance. Empirical analysis of different mixing methods reveals that the resulting H-infinity-RL controller outperforms the prior PID-RL combination and can handle more complex tasks involving intensive thrust vectoring. We provide our code as open-source at https://github.com/robot-perception-group/robust_deep_residual_blimp.

翻译：由于其更高的能源效率，气球可以取代四旋翼飞行器执行长时间的空中任务。然而，设计能应对复杂动力学、建模误差和干扰的控制器仍然是一个尚未解决的挑战。最近的研究结合强化学习（RL）和PID控制器来解决这个挑战，并在实际实验中证明了其有效性。在当前工作中，我们基于此，利用H-无穷鲁棒控制器来扩大稳定边界并提高RL智能体的性能。关于不同混合方法的经验分析表明，结果证明基于H-无穷鲁棒深度残差强化学习的控制器优于之前的PID/RL组合，并且可以处理涉及积极推力矢量的更复杂任务。我们将我们的代码作为开源提供在 https://github.com/robot-perception-group/robust_deep_residual_blimp。

0

相关内容

深度残差强化学习

深度残差强化学习

【硬核书】深度强化学习实践手册：应用现代RL方法，包括深度Q网络、值迭代、策略梯度、TRPO、AlphaGo等，547页pdf

【硬核书】深度强化学习实践手册：应用现代RL方法，包括深度Q网络、值迭代、策略梯度、TRPO、AlphaGo等，547页pdf

专知会员服务

79+阅读 · 2022年12月11日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

【AdaMod】一个新的深度学习优化与记忆（Meet AdaMod: a new deep learning optimizer with memory）

【AdaMod】一个新的深度学习优化与记忆（Meet AdaMod: a new deep learning optimizer with memory）

专知会员服务

15+阅读 · 2020年1月13日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

泡泡机器人SLAM

25+阅读 · 2019年1月17日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

基于重要性采样的并行离策略强化学习方法研究

国家自然科学基金

23+阅读 · 2015年12月31日

无尾飞翼布局飞行器的操纵面故障强化学习最优自适应补偿控制研究

国家自然科学基金

1+阅读 · 2014年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

基于智能在线虚拟参考反馈整定的控制方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于融合智能算法斜拉桥振动控制Benchmark问题的混合控制策略研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向复杂流场的欠驱动AUV路径跟踪控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于近似动态规划的非线性系统鲁棒优化控制

国家自然科学基金

2+阅读 · 2012年12月31日

外部干扰下自主式水下机器人推进器与导航传感器故障诊断方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于时空梯度耦合虚拟目标的欠驱动AUV航迹控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

汽车复杂约束下的多目标集成控制研究

国家自然科学基金

0+阅读 · 2011年12月31日

OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research

Arxiv

0+阅读 · 2023年5月16日

Spatial-temporal recurrent reinforcement learning for autonomous ships

Arxiv

0+阅读 · 2023年5月15日

Deep RL with Hierarchical Action Exploration for Dialogue Generation

Arxiv

0+阅读 · 2023年5月15日

On the Reuse Bias in Off-Policy Reinforcement Learning

Arxiv

0+阅读 · 2023年5月15日

State-wise Safe Reinforcement Learning: A Survey

Arxiv

0+阅读 · 2023年5月13日

S-REINFORCE: A Neuro-Symbolic Policy Gradient Approach for Interpretable Reinforcement Learning

Arxiv

0+阅读 · 2023年5月12日

Quantile-Based Deep Reinforcement Learning using Two-Timescale Policy Gradient Algorithms

Arxiv

0+阅读 · 2023年5月12日

Deep Reinforcement Learning for Interference Management in UAV-based 3D Networks: Potentials and Challenges

Arxiv

0+阅读 · 2023年5月11日

Optimizing Memory Mapping Using Deep Reinforcement Learning

Arxiv

0+阅读 · 2023年5月11日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

VIP会员

文章信息

相关主题

深度残差强化学习

相关VIP内容

【硬核书】深度强化学习实践手册：应用现代RL方法，包括深度Q网络、值迭代、策略梯度、TRPO、AlphaGo等，547页pdf

【硬核书】深度强化学习实践手册：应用现代RL方法，包括深度Q网络、值迭代、策略梯度、TRPO、AlphaGo等，547页pdf

专知会员服务

79+阅读 · 2022年12月11日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

【AdaMod】一个新的深度学习优化与记忆（Meet AdaMod: a new deep learning optimizer with memory）

【AdaMod】一个新的深度学习优化与记忆（Meet AdaMod: a new deep learning optimizer with memory）

专知会员服务

15+阅读 · 2020年1月13日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《小型无人机系统侦测追踪技术：声学、计算机视觉与深度学习融合方案》最新98页

《"牧羊人网格"拦截策略：实现无人机集群可靠拦截的新范式》

光纤无人机：反无人机系统的重大挑战

《作战建模与仿真实证研究》

相关资讯

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

泡泡机器人SLAM

25+阅读 · 2019年1月17日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research

Arxiv

0+阅读 · 2023年5月16日

Spatial-temporal recurrent reinforcement learning for autonomous ships

Arxiv

0+阅读 · 2023年5月15日

Deep RL with Hierarchical Action Exploration for Dialogue Generation

Arxiv

0+阅读 · 2023年5月15日

On the Reuse Bias in Off-Policy Reinforcement Learning

Arxiv

0+阅读 · 2023年5月15日

State-wise Safe Reinforcement Learning: A Survey

Arxiv

0+阅读 · 2023年5月13日

S-REINFORCE: A Neuro-Symbolic Policy Gradient Approach for Interpretable Reinforcement Learning

Arxiv

0+阅读 · 2023年5月12日

Quantile-Based Deep Reinforcement Learning using Two-Timescale Policy Gradient Algorithms

Arxiv

0+阅读 · 2023年5月12日

Deep Reinforcement Learning for Interference Management in UAV-based 3D Networks: Potentials and Challenges

Arxiv

0+阅读 · 2023年5月11日

Optimizing Memory Mapping Using Deep Reinforcement Learning

Arxiv

0+阅读 · 2023年5月11日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

相关基金

基于重要性采样的并行离策略强化学习方法研究

国家自然科学基金

23+阅读 · 2015年12月31日

无尾飞翼布局飞行器的操纵面故障强化学习最优自适应补偿控制研究

国家自然科学基金

1+阅读 · 2014年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

基于智能在线虚拟参考反馈整定的控制方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于融合智能算法斜拉桥振动控制Benchmark问题的混合控制策略研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向复杂流场的欠驱动AUV路径跟踪控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于近似动态规划的非线性系统鲁棒优化控制

国家自然科学基金

2+阅读 · 2012年12月31日

外部干扰下自主式水下机器人推进器与导航传感器故障诊断方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于时空梯度耦合虚拟目标的欠驱动AUV航迹控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

汽车复杂约束下的多目标集成控制研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员