控制一个使用基于模型的强化学习的低活性飞行机器人 (Nonholonomic Yaw Control of an Underactuated Flying Robot with Model-based Reinforcement Learning) - 专知论文

会员服务 ·

0

控制器 · Brackets · 强化学习 · 学成 · Engineering ·

2021 年 1 月 12 日

Nonholonomic Yaw Control of an Underactuated Flying Robot with Model-based Reinforcement Learning

翻译：控制一个使用基于模型的强化学习的低活性飞行机器人

Nathan Lambert,Craig Schindler,Daniel Drew,Kristofer Pister

from arxiv, 7 pages, 1 page appendix

Nonholonomic control is a candidate to control nonlinear systems with path-dependant states. We investigate an underactuated flying micro-aerial-vehicle, the ionocraft, that requires nonholonomic control in the yaw-direction for complete attitude control. Deploying an analytical control law involves substantial engineering design and is sensitive to inaccuracy in the system model. With specific assumptions on assembly and system dynamics, we derive a Lie bracket for yaw control of the ionocraft. As a comparison to the significant engineering effort required for an analytic control law, we implement a data-driven model-based reinforcement learning yaw controller in a simulated flight task. We demonstrate that a simple model-based reinforcement learning framework can match the derived Lie bracket control (in yaw rate and chosen actions) in a few minutes of flight data, without a pre-defined dynamics function. This paper shows that learning-based approaches are useful as a tool for synthesis of nonlinear control laws previously only addressable through expert-based design.

翻译：非血压控制是控制具有路径依赖状态的非线性系统的一个候选。我们调查了一种未充分激活的飞行微型飞行器,即电离飞行器,它需要在亚线方向上进行非热层控制以完全姿态控制。部署一种分析控制法需要大量的工程设计,并且对系统模型的不准确性敏感。在对组装和系统动态的具体假设下, 我们得出了一个对离子体的亚线控制。作为与分析控制法所需的重大工程努力的比较, 我们在模拟飞行任务中执行一种基于数据驱动的模型强化学习电线控制器。我们证明一个基于简单模型的强化学习框架可以在飞行数据数分钟内匹配衍生的列列控(亚线率和选定动作), 没有预先定义的动态功能。该文件显示, 学习法是有用的工具, 用于合成先前只能通过专家设计处理的非线性控制法。

0

相关内容

控制器

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

专知会员服务

89+阅读 · 2021年1月12日

最新《联邦学习Federated Learning》报告，Federated Learning

最新《联邦学习Federated Learning》报告，Federated Learning

专知会员服务

89+阅读 · 2020年12月2日

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

专知会员服务

41+阅读 · 2020年4月11日

【教程推荐】可信任深度学习，44页ppt，PDE Based Trustworthy Deep Learning

【教程推荐】可信任深度学习，44页ppt，PDE Based Trustworthy Deep Learning

专知会员服务

37+阅读 · 2020年3月14日

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

专知会员服务

80+阅读 · 2020年3月4日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【强化学习研讨会|Microsoft Research】利用批量强化学习确定医疗保健中的治疗方案（Towards Using Batch Reinforcement Learning to Identify Treatment Options in Healthcare）,哈佛大学助理教授| Finale Doshi-Velez

【强化学习研讨会|Microsoft Research】利用批量强化学习确定医疗保健中的治疗方案（Towards Using Batch Reinforcement Learning to Identify Treatment Options in Healthcare）,哈佛大学助理教授| Finale Doshi-Velez

专知会员服务

7+阅读 · 2019年10月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Andrew NG的新书《Machine Learning Yearning》

Andrew NG的新书《Machine Learning Yearning》

我爱机器学习

11+阅读 · 2016年12月7日

Bilateral Control-Based Imitation Learning for Velocity-Controlled Robot

Arxiv

0+阅读 · 2021年3月9日

A model-based framework for learning transparent swarm behaviors

Arxiv

0+阅读 · 2021年3月9日

Decentralized Circle Formation Control for Fish-like Robots in the Real-world via Reinforcement Learning

Arxiv

0+阅读 · 2021年3月9日

Safe Active Dynamics Learning and Control: A Sequential Exploration-Exploitation Framework

Arxiv

0+阅读 · 2021年3月8日

Off-Belief Learning

Arxiv

0+阅读 · 2021年3月6日

MAMBPO: Sample-efficient multi-robot reinforcement learning using learned world models

Arxiv

0+阅读 · 2021年3月5日

Learning to Walk via Deep Reinforcement Learning

Arxiv

7+阅读 · 2018年12月26日

An Introduction to Deep Reinforcement Learning

Arxiv

4+阅读 · 2018年12月3日

A survey on policy search algorithms for learning robot controllers in a handful of trials

Arxiv

3+阅读 · 2018年7月6日

Learning to Adapt: Meta-Learning for Model-Based Control

Arxiv

9+阅读 · 2018年3月30日

VIP会员

文章信息

相关主题

相关VIP内容

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

专知会员服务

89+阅读 · 2021年1月12日

最新《联邦学习Federated Learning》报告，Federated Learning

最新《联邦学习Federated Learning》报告，Federated Learning

专知会员服务

89+阅读 · 2020年12月2日

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

专知会员服务

41+阅读 · 2020年4月11日

【教程推荐】可信任深度学习，44页ppt，PDE Based Trustworthy Deep Learning

【教程推荐】可信任深度学习，44页ppt，PDE Based Trustworthy Deep Learning

专知会员服务

37+阅读 · 2020年3月14日

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

专知会员服务

80+阅读 · 2020年3月4日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【强化学习研讨会|Microsoft Research】利用批量强化学习确定医疗保健中的治疗方案（Towards Using Batch Reinforcement Learning to Identify Treatment Options in Healthcare）,哈佛大学助理教授| Finale Doshi-Velez

【强化学习研讨会|Microsoft Research】利用批量强化学习确定医疗保健中的治疗方案（Towards Using Batch Reinforcement Learning to Identify Treatment Options in Healthcare）,哈佛大学助理教授| Finale Doshi-Velez

专知会员服务

7+阅读 · 2019年10月3日

热门VIP内容

开通专知VIP会员享更多权益服务

《美空军条令出版物：战略打击》最新条令

《高能激光武器》22页slides

军事前沿模型

《面向小型无人机或无人飞行器的创新雷达探测与人工智能分类技术》263页

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Andrew NG的新书《Machine Learning Yearning》

Andrew NG的新书《Machine Learning Yearning》

我爱机器学习

11+阅读 · 2016年12月7日

相关论文

Bilateral Control-Based Imitation Learning for Velocity-Controlled Robot

Arxiv

0+阅读 · 2021年3月9日

A model-based framework for learning transparent swarm behaviors

Arxiv

0+阅读 · 2021年3月9日

Decentralized Circle Formation Control for Fish-like Robots in the Real-world via Reinforcement Learning

Arxiv

0+阅读 · 2021年3月9日

Safe Active Dynamics Learning and Control: A Sequential Exploration-Exploitation Framework

Arxiv

0+阅读 · 2021年3月8日

Off-Belief Learning

Arxiv

0+阅读 · 2021年3月6日

MAMBPO: Sample-efficient multi-robot reinforcement learning using learned world models

Arxiv

0+阅读 · 2021年3月5日

Learning to Walk via Deep Reinforcement Learning

Arxiv

7+阅读 · 2018年12月26日

An Introduction to Deep Reinforcement Learning

Arxiv

4+阅读 · 2018年12月3日

A survey on policy search algorithms for learning robot controllers in a handful of trials

Arxiv

3+阅读 · 2018年7月6日

Learning to Adapt: Meta-Learning for Model-Based Control

Arxiv

9+阅读 · 2018年3月30日

微信扫码咨询专知VIP会员