Translated Title: 数据高效的深度强化学习在固定翼无人机姿态控制中的应用：现场实验 (Data-Efficient Deep Reinforcement Learning for Attitude Control of Fixed-Wing UAVs: Field Experiments) - 专知论文

会员服务 ·

0

姿态控制 · 无人机 · 控制器 · 深度强化学习 · 强化学习 ·

2023 年 4 月 19 日

Data-Efficient Deep Reinforcement Learning for Attitude Control of Fixed-Wing UAVs: Field Experiments

翻译：Translated Title: 数据高效的深度强化学习在固定翼无人机姿态控制中的应用：现场实验

Eivind Bøhn,Erlend M. Coates,Dirk Reinhardt,Tor Arne Johansen

from arxiv, Published in IEEE Transactions on Neural Networks and Learning Systems - Special Issue: Reinforcement Learning Based Control: Data-Efficient and Resilient Methods

Attitude control of fixed-wing unmanned aerial vehicles (UAVs) is a difficult control problem in part due to uncertain nonlinear dynamics, actuator constraints, and coupled longitudinal and lateral motions. Current state-of-the-art autopilots are based on linear control and are thus limited in their effectiveness and performance. Deep reinforcement learning (DRL) is a machine learning method to automatically discover optimal control laws through interaction with the controlled system, which can handle complex nonlinear dynamics. We show in this paper that DRL can successfully learn to perform attitude control of a fixed-wing UAV operating directly on the original nonlinear dynamics, requiring as little as three minutes of flight data. We initially train our model in a simulation environment and then deploy the learned controller on the UAV in flight tests, demonstrating comparable performance to the state-of-the-art ArduPlane proportional-integral-derivative (PID) attitude controller with no further online learning required. Learning with significant actuation delay and diversified simulated dynamics were found to be crucial for successful transfer to control of the real UAV. In addition to a qualitative comparison with the ArduPlane autopilot, we present a quantitative assessment based on linear analysis to better understand the learning controller's behavior.

翻译：Translated Abstract: 由于具有不确定的非线性动态、执行器约束和长、横向耦合运动，固定翼无人机的姿态控制是一个困难的控制问题。目前，基于线性控制的最先进自动驾驶仍然受限于其有效性和性能。深度强化学习（DRL）是一种通过与受控系统的交互自动发现最优控制规律的机器学习方法，可以处理复杂的非线性动态问题。本文展示了DRL能够成功地学习使用仅仅三分钟的飞行数据来执行固定翼无人机的姿态控制。我们首先在仿真环境中训练模型，然后将学习到的控制器部署到飞行测试的无人机上，在不需要进一步的在线学习的情况下表现出与最先进的ArduPlane比例积分导数（PID）姿态控制器相当的性能。发现在大幅度作动延迟和多样化的模拟动力学下进行学习对成功的变换到实际无人机控制至关重要。除了与ArduPlane自动驾驶系统的定性比较外，我们还基于线性分析进行了量化评估，以更好地了解学习控制器的行为。

0

相关内容

姿态控制

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【伯克利JD Co-Reyes博士论文】建立强化学习算法泛化:从潜在动力学模型到元学习，Building Reinforcement Learning Algorithms that Generalize: From Latent Dynamics Models to Meta-Learning

【伯克利JD Co-Reyes博士论文】建立强化学习算法泛化:从潜在动力学模型到元学习，Building Reinforcement Learning Algorithms that Generalize: From Latent Dynamics Models to Meta-Learning

专知会员服务

45+阅读 · 2022年3月6日

67页PPT【ML+气象】使用机器学习技术对季节和次季节研究和预测，Use of Machine Learning Techniques for Seasonal and Subseasonal Studies and Predictions

67页PPT【ML+气象】使用机器学习技术对季节和次季节研究和预测，Use of Machine Learning Techniques for Seasonal and Subseasonal Studies and Predictions

专知会员服务

19+阅读 · 2022年3月4日

【2022新书】强化学习工业应用，408页pdf

【2022新书】强化学习工业应用，408页pdf

专知会员服务

231+阅读 · 2022年2月3日

【ICML2021】核持续学习，Kernel Continual Learning

专知会员服务

32+阅读 · 2021年7月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

【Pieter Abbeel 报告@CMU】元学习与深度强化学习机器人应用，Deep Learning to Learn，84页ppt

【Pieter Abbeel 报告@CMU】元学习与深度强化学习机器人应用，Deep Learning to Learn，84页ppt

专知会员服务

32+阅读 · 2019年10月12日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

输入约束下的多智能体系统完全分布式协调控制研究

国家自然科学基金

4+阅读 · 2015年12月31日

镍基催化体系硫代硫酸盐高效浸金基础研究

国家自然科学基金

0+阅读 · 2015年12月31日

高阶非线性多智能体系统的有限时间协调控制研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于一致性理论的多无人机协同控制和决策方法

国家自然科学基金

5+阅读 · 2012年12月31日

基于监督式ADP的汽车智能巡航控制

国家自然科学基金

1+阅读 · 2012年12月31日

体内花瓣型全悬浮胶囊机器人的多楔形效应

国家自然科学基金

0+阅读 · 2011年12月31日

结构多维随机振动控制

国家自然科学基金

0+阅读 · 2011年12月31日

网络Euler-Lagrange系统的分布式协调控制问题研究

国家自然科学基金

1+阅读 · 2011年12月31日

基于MR阻尼器的纵飘桥梁被动减振－半主动减震控制研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于机器视觉和惯性测量的轮式滑动转向移动机器人定位导航与遥感知

国家自然科学基金

0+阅读 · 2008年12月31日

Geometric Algebra for Optimal Control with Applications in Manipulation Tasks

Arxiv

0+阅读 · 2023年6月5日

An adaptive safety layer with hard constraints for safe reinforcement learning in multi-energy management systems

Arxiv

0+阅读 · 2023年6月5日

Attention-Based Recurrence for Multi-Agent Reinforcement Learning under Stochastic Partial Observability

Arxiv

0+阅读 · 2023年6月3日

Optimal Control for Articulated Soft Robots

Arxiv

0+阅读 · 2023年6月2日

Efficient Multi-Task and Transfer Reinforcement Learning with Parameter-Compositional Framework

Arxiv

0+阅读 · 2023年6月2日

Safe Offline Reinforcement Learning with Real-Time Budget Constraints

Arxiv

0+阅读 · 2023年6月1日

Deep Reinforcement Learning for Multi-Agent Interaction

Arxiv

44+阅读 · 2022年8月2日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Arxiv

20+阅读 · 2020年3月10日

A Survey of Reinforcement Learning Techniques: Strategies, Recent Development, and Future Directions

A Survey of Reinforcement Learning Techniques: Strategies, Recent Development, and Future Directions

Arxiv

79+阅读 · 2020年1月19日

VIP会员

文章信息

相关主题

深度强化学习

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【伯克利JD Co-Reyes博士论文】建立强化学习算法泛化:从潜在动力学模型到元学习，Building Reinforcement Learning Algorithms that Generalize: From Latent Dynamics Models to Meta-Learning

【伯克利JD Co-Reyes博士论文】建立强化学习算法泛化:从潜在动力学模型到元学习，Building Reinforcement Learning Algorithms that Generalize: From Latent Dynamics Models to Meta-Learning

专知会员服务

45+阅读 · 2022年3月6日

67页PPT【ML+气象】使用机器学习技术对季节和次季节研究和预测，Use of Machine Learning Techniques for Seasonal and Subseasonal Studies and Predictions

67页PPT【ML+气象】使用机器学习技术对季节和次季节研究和预测，Use of Machine Learning Techniques for Seasonal and Subseasonal Studies and Predictions

专知会员服务

19+阅读 · 2022年3月4日

【2022新书】强化学习工业应用，408页pdf

【2022新书】强化学习工业应用，408页pdf

专知会员服务

231+阅读 · 2022年2月3日

【ICML2021】核持续学习，Kernel Continual Learning

专知会员服务

32+阅读 · 2021年7月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

【Pieter Abbeel 报告@CMU】元学习与深度强化学习机器人应用，Deep Learning to Learn，84页ppt

【Pieter Abbeel 报告@CMU】元学习与深度强化学习机器人应用，Deep Learning to Learn，84页ppt

专知会员服务

32+阅读 · 2019年10月12日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Geometric Algebra for Optimal Control with Applications in Manipulation Tasks

Arxiv

0+阅读 · 2023年6月5日

An adaptive safety layer with hard constraints for safe reinforcement learning in multi-energy management systems

Arxiv

0+阅读 · 2023年6月5日

Attention-Based Recurrence for Multi-Agent Reinforcement Learning under Stochastic Partial Observability

Arxiv

0+阅读 · 2023年6月3日

Optimal Control for Articulated Soft Robots

Arxiv

0+阅读 · 2023年6月2日

Efficient Multi-Task and Transfer Reinforcement Learning with Parameter-Compositional Framework

Arxiv

0+阅读 · 2023年6月2日

Safe Offline Reinforcement Learning with Real-Time Budget Constraints

Arxiv

0+阅读 · 2023年6月1日

Deep Reinforcement Learning for Multi-Agent Interaction

Arxiv

44+阅读 · 2022年8月2日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Arxiv

20+阅读 · 2020年3月10日

A Survey of Reinforcement Learning Techniques: Strategies, Recent Development, and Future Directions

A Survey of Reinforcement Learning Techniques: Strategies, Recent Development, and Future Directions

Arxiv

79+阅读 · 2020年1月19日

相关基金

输入约束下的多智能体系统完全分布式协调控制研究

国家自然科学基金

4+阅读 · 2015年12月31日

镍基催化体系硫代硫酸盐高效浸金基础研究

国家自然科学基金

0+阅读 · 2015年12月31日

高阶非线性多智能体系统的有限时间协调控制研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于一致性理论的多无人机协同控制和决策方法

国家自然科学基金

5+阅读 · 2012年12月31日

基于监督式ADP的汽车智能巡航控制

国家自然科学基金

1+阅读 · 2012年12月31日

体内花瓣型全悬浮胶囊机器人的多楔形效应

国家自然科学基金

0+阅读 · 2011年12月31日

结构多维随机振动控制

国家自然科学基金

0+阅读 · 2011年12月31日

网络Euler-Lagrange系统的分布式协调控制问题研究

国家自然科学基金

1+阅读 · 2011年12月31日

基于MR阻尼器的纵飘桥梁被动减振－半主动减震控制研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于机器视觉和惯性测量的轮式滑动转向移动机器人定位导航与遥感知

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员