以深入强化学习为基础,对自然水域自动水面船只进行跟踪控制 (Deep Reinforcement Learning Based Tracking Control of an Autonomous Surface Vessel in Natural Waters) - 专知论文

会员服务 ·

0

控制器 · 回合 · Microsoft Surface · Performer · Learning ·

2023 年 2 月 16 日

Deep Reinforcement Learning Based Tracking Control of an Autonomous Surface Vessel in Natural Waters

翻译：以深入强化学习为基础,对自然水域自动水面船只进行跟踪控制

Wei Wang,Xiaojing Cao,Alejandro Gonzalez-Garcia,Lianhao Yin,Niklas Hagemann,Yuanyuan Qiao,Carlo Ratti,Daniela Rus

from arxiv, ICRA 2023

Accurate control of autonomous marine robots still poses challenges due to the complex dynamics of the environment. In this paper, we propose a Deep Refinement Learning (DRL) approach to train a controller for autonomous surface vessel (ASV) trajectory tracking and compare its performance with an advanced nonlinear model predictive controller (NMPC) in real environments. Taking into account environmental disturbances (e.g., wind, waves, and currents), noisy measurements, and non-ideal actuators presented in the physical ASV, several effective reward functions for DRL tracking control policies are carefully designed. The control policies were trained in a simulation environment with diverse tracking trajectories and disturbances. The performance of the DRL controller has been verified and compared with the NMPC in both simulations with model-based environmental disturbances and in natural waters. Simulations show that the DRL controller has 53.33% lower tracking error than that of NMPC. Experimental results further show that, compared to NMPC, the DRL controller has 35.51% lower tracking error, indicating that DRL controllers offer better disturbance rejection in river environments than NMPC.

翻译：由于环境的复杂动态,对自主海洋机器人的精确控制仍构成挑战。在本文件中,我们提议采用深精学习(DRL)方法,在实际环境中培训自主水面容器控制器(ASV)轨迹跟踪,并将其性能与高级非线性模型预测控制器(NMPPC)进行对比。考虑到环境扰动(如风、波浪和洋流)、噪音测量以及物理ASV中显示的非理想动力驱动器,对DRL跟踪控制政策的若干有效奖励功能精心设计。控制政策在模拟环境中经过各种跟踪轨迹和扰动的训练。DRL控制器的性能经过核查,并与NMPC在模型环境扰动和自然水域的模拟中的表现进行了对比。模拟显示,DRL控制器的跟踪错误比NMPC低53.33%。实验结果进一步表明,与NMPC相比,DRL控制器的跟踪误差为35.51%,表明DRL控制器在河流环境中的抗扰动力阻力能力比NMPC。

0

相关内容

控制器

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【ALT 2019 Tutorials】强化学习的探索性开发（Exploration-Exploitation in Reinforcement Learning）

【ALT 2019 Tutorials】强化学习的探索性开发（Exploration-Exploitation in Reinforcement Learning）

专知会员服务

34+阅读 · 2019年3月21日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

时滞正Markov跳变系统的分布式控制与滤波

国家自然科学基金

0+阅读 · 2015年12月31日

基于SiPM的高性能In-Beam TOF-PET的研究

国家自然科学基金

0+阅读 · 2014年12月31日

光栅剪切干涉Zernike模式法重建精度优化方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

包覆型络合态Fe(II)控释强化UV/过硫酸盐技术降解水中离子载体抗生素的效能与机理

国家自然科学基金

0+阅读 · 2013年12月31日

微型光纤压力传感器的研究

国家自然科学基金

0+阅读 · 2013年12月31日

MicRNA107调控BACE1mRNA基因与阿尔茨海默病内质网应激病理机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

肽对美拉德反应产生肉香味化合物的影响及形成机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

高灵敏度表面等离子激元共振光纤生物化学传感关键技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

蒽醌/石墨烯纳米复合材料电极的电催化氧还原性能及其在异相electro-Fenton-like体系中的应用研究

国家自然科学基金

0+阅读 · 2011年12月31日

磁性Pickering乳液界面流变学研究

国家自然科学基金

0+阅读 · 2008年12月31日

Eagle: End-to-end Deep Reinforcement Learning based Autonomous Control of PTZ Cameras

Arxiv

0+阅读 · 2023年4月10日

Finite Time Lyapunov Exponent Analysis of Model Predictive Control and Reinforcement Learning

Arxiv

0+阅读 · 2023年4月10日

Large-Scale Regional Traffic Signal Control Using Dynamic Deep Reinforcement Learning

Arxiv

0+阅读 · 2023年4月7日

AR3n: A Reinforcement Learning-based Assist-As-Needed Controller for Robotic Rehabilitation

Arxiv

0+阅读 · 2023年4月6日

PyFlyt -- UAV Simulation Environments for Reinforcement Learning Research

Arxiv

0+阅读 · 2023年4月3日

Action Pick-up in Dynamic Action Space Reinforcement Learning

Arxiv

0+阅读 · 2023年4月3日

Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics

Arxiv

0+阅读 · 2023年4月3日

Adaptive formation motion planning and control of autonomous underwater vehicles using deep reinforcement learning

Arxiv

0+阅读 · 2023年4月1日

Recent Advances in Reinforcement Learning in Finance

Arxiv

11+阅读 · 2021年12月8日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

VIP会员

文章信息

相关主题

Microsoft Surface

相关VIP内容

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【ALT 2019 Tutorials】强化学习的探索性开发（Exploration-Exploitation in Reinforcement Learning）

【ALT 2019 Tutorials】强化学习的探索性开发（Exploration-Exploitation in Reinforcement Learning）

专知会员服务

34+阅读 · 2019年3月21日

热门VIP内容

开通专知VIP会员享更多权益服务

《小型无人机系统侦测追踪技术：声学、计算机视觉与深度学习融合方案》最新98页

《"牧羊人网格"拦截策略：实现无人机集群可靠拦截的新范式》

光纤无人机：反无人机系统的重大挑战

《作战建模与仿真实证研究》

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Eagle: End-to-end Deep Reinforcement Learning based Autonomous Control of PTZ Cameras

Arxiv

0+阅读 · 2023年4月10日

Finite Time Lyapunov Exponent Analysis of Model Predictive Control and Reinforcement Learning

Arxiv

0+阅读 · 2023年4月10日

Large-Scale Regional Traffic Signal Control Using Dynamic Deep Reinforcement Learning

Arxiv

0+阅读 · 2023年4月7日

AR3n: A Reinforcement Learning-based Assist-As-Needed Controller for Robotic Rehabilitation

Arxiv

0+阅读 · 2023年4月6日

PyFlyt -- UAV Simulation Environments for Reinforcement Learning Research

Arxiv

0+阅读 · 2023年4月3日

Action Pick-up in Dynamic Action Space Reinforcement Learning

Arxiv

0+阅读 · 2023年4月3日

Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics

Arxiv

0+阅读 · 2023年4月3日

Adaptive formation motion planning and control of autonomous underwater vehicles using deep reinforcement learning

Arxiv

0+阅读 · 2023年4月1日

Recent Advances in Reinforcement Learning in Finance

Arxiv

11+阅读 · 2021年12月8日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

相关基金

时滞正Markov跳变系统的分布式控制与滤波

国家自然科学基金

0+阅读 · 2015年12月31日

基于SiPM的高性能In-Beam TOF-PET的研究

国家自然科学基金

0+阅读 · 2014年12月31日

光栅剪切干涉Zernike模式法重建精度优化方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

包覆型络合态Fe(II)控释强化UV/过硫酸盐技术降解水中离子载体抗生素的效能与机理

国家自然科学基金

0+阅读 · 2013年12月31日

微型光纤压力传感器的研究

国家自然科学基金

0+阅读 · 2013年12月31日

MicRNA107调控BACE1mRNA基因与阿尔茨海默病内质网应激病理机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

肽对美拉德反应产生肉香味化合物的影响及形成机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

高灵敏度表面等离子激元共振光纤生物化学传感关键技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

蒽醌/石墨烯纳米复合材料电极的电催化氧还原性能及其在异相electro-Fenton-like体系中的应用研究

国家自然科学基金

0+阅读 · 2011年12月31日

磁性Pickering乳液界面流变学研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员