数据效率模型学习框架,用于封闭液控制环流机器人机器人 (A Data-Efficient Model-Based Learning Framework for the Closed-Loop Control of Continuum Robots) - 专知论文

会员服务 ·

0

控制器 · 学成 · RNN · 机器人 · MoDELS ·

2022 年 4 月 22 日

A Data-Efficient Model-Based Learning Framework for the Closed-Loop Control of Continuum Robots

翻译：数据效率模型学习框架,用于封闭液控制环流机器人机器人

Xinran Wang,Nicolas Rojas

from arxiv, Accepted to the 2022 IEEE 5th International Conference on Soft Robotics (RoboSoft) on Feb 7th 2022

Traditional dynamic models of continuum robots are in general computationally expensive and not suitable for real-time control. Recent approaches using learning-based methods to approximate the dynamic model of continuum robots for control have been promising, although real data hungry -- which may cause potential damage to robots and be time consuming -- and getting poorer performance when trained with simulation data only. This paper presents a model-based learning framework for continuum robot closed-loop control that, by combining simulation and real data, shows to require only 100 real data to outperform a real-data-only controller trained using up to 10000 points. The introduced data-efficient framework with three control policies has utilized a Gaussian process regression (GPR) and a recurrent neural network (RNN). Control policy A uses a GPR model and a RNN trained in simulation to optimize control outputs for simulated targets; control policy B retrains the RNN in policy A with data generated from the GPR model to adapt to real robot physics; control policy C utilizes policy A and B to form a hybrid policy. Using a continuum robot with soft spines, we show that our approach provides an efficient framework to bridge the sim-to-real gap in model-based learning for continuum robots.

翻译：连续机器人的传统动态模型一般在计算上成本高昂,不适于实时控制。最近采用学习方法对连续机器人动态模型进行近似于连续机器人动态模型进行控制的方法是很有希望的,尽管实际数据饥饿 -- -- 可能对机器人造成潜在损害并耗费时间 -- -- 而且在仅接受模拟数据培训时性能更差。本文为连续机器人闭路控制提供了一个基于模型的学习框架,通过将模拟数据与真实数据相结合,显示只需要100个真实数据,才能超过一个经过训练的、使用最多10000点的实际数据控制器。引入了三个控制政策的数据效率框架,使用了高斯进程回归(GPR)和一个经常性神经网络(RNN)。控制政策A使用GPR模型和一个经过模拟培训的RNN,以优化模拟目标的控制产出;控制政策B在政策A中用GPR模型生成的数据重新输入RNN,以适应真实的机器人物理学;控制政策C利用政策A和B形成混合政策。使用软脊盖的连续机器人,我们展示了一种高效的框架,用以在模拟中连接Simto-to的机器人。

0

相关内容

控制器

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

磁各向异性催化剂制备及其磁稳定床加氢催化研究

国家自然科学基金

0+阅读 · 2014年12月31日

衰老小鼠线粒体促凋亡蛋白Omi/HtrA2表达增加在加重帕金森病中的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Ag3PO4/MoS2/TiO2纳米管新型复合电极的制备及其光电催化降解水中磺胺类抗生素的机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

面向人工心脏的多方位分布式无线传能机理与实验研究

国家自然科学基金

0+阅读 · 2013年12月31日

深海真菌抗植物病原真菌菌株的筛选及其活性物质研究

国家自然科学基金

0+阅读 · 2012年12月31日

目标运动突变和几何外观急剧变化的视觉跟踪

国家自然科学基金

0+阅读 · 2012年12月31日

特快速暂态过电压下特高压变压器绕组的建模算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

金属铝配位氢化物的储锂机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

民猪抗寒基因的筛选与鉴定

国家自然科学基金

0+阅读 · 2010年12月31日

Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?

Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?

Arxiv

0+阅读 · 2022年6月10日

Meta-Reinforcement Learning with Self-Modifying Networks

Meta-Reinforcement Learning with Self-Modifying Networks

Arxiv

1+阅读 · 2022年6月10日

Regret Bounds for Information-Directed Reinforcement Learning

Arxiv

0+阅读 · 2022年6月9日

A Study of Continual Learning Methods for Q-Learning

A Study of Continual Learning Methods for Q-Learning

Arxiv

0+阅读 · 2022年6月8日

Sim2real for Reinforcement Learning Driven Next Generation Networks

Arxiv

1+阅读 · 2022年6月8日

An Information-Theoretic Framework for Supervised Learning

Arxiv

0+阅读 · 2022年6月7日

MIX-MAB: Reinforcement Learning-based Resource Allocation Algorithm for LoRaWAN

Arxiv

0+阅读 · 2022年6月7日

Debiased Self-Training for Semi-Supervised Learning

Arxiv

0+阅读 · 2022年6月7日

Learning Treatment Plan Representations for Content Based Image Retrieval

Arxiv

0+阅读 · 2022年6月6日

Learning Discrete Structures for Graph Neural Networks

Arxiv

17+阅读 · 2019年3月28日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?

Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?

Arxiv

0+阅读 · 2022年6月10日

Meta-Reinforcement Learning with Self-Modifying Networks

Meta-Reinforcement Learning with Self-Modifying Networks

Arxiv

1+阅读 · 2022年6月10日

Regret Bounds for Information-Directed Reinforcement Learning

Arxiv

0+阅读 · 2022年6月9日

A Study of Continual Learning Methods for Q-Learning

A Study of Continual Learning Methods for Q-Learning

Arxiv

0+阅读 · 2022年6月8日

Sim2real for Reinforcement Learning Driven Next Generation Networks

Arxiv

1+阅读 · 2022年6月8日

An Information-Theoretic Framework for Supervised Learning

Arxiv

0+阅读 · 2022年6月7日

MIX-MAB: Reinforcement Learning-based Resource Allocation Algorithm for LoRaWAN

Arxiv

0+阅读 · 2022年6月7日

Debiased Self-Training for Semi-Supervised Learning

Arxiv

0+阅读 · 2022年6月7日

Learning Treatment Plan Representations for Content Based Image Retrieval

Arxiv

0+阅读 · 2022年6月6日

Learning Discrete Structures for Graph Neural Networks

Arxiv

17+阅读 · 2019年3月28日

相关基金

磁各向异性催化剂制备及其磁稳定床加氢催化研究

国家自然科学基金

0+阅读 · 2014年12月31日

衰老小鼠线粒体促凋亡蛋白Omi/HtrA2表达增加在加重帕金森病中的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Ag3PO4/MoS2/TiO2纳米管新型复合电极的制备及其光电催化降解水中磺胺类抗生素的机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

面向人工心脏的多方位分布式无线传能机理与实验研究

国家自然科学基金

0+阅读 · 2013年12月31日

深海真菌抗植物病原真菌菌株的筛选及其活性物质研究

国家自然科学基金

0+阅读 · 2012年12月31日

目标运动突变和几何外观急剧变化的视觉跟踪

国家自然科学基金

0+阅读 · 2012年12月31日

特快速暂态过电压下特高压变压器绕组的建模算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

金属铝配位氢化物的储锂机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

民猪抗寒基因的筛选与鉴定

国家自然科学基金

0+阅读 · 2010年12月31日

微信扫码咨询专知VIP会员