多阶段多目的、多目的、多目的、多目的、有适应性等级课程的操纵 (Multi-Phase Multi-Objective Dexterous Manipulation with Adaptive Hierarchical Curriculum) - 专知论文

会员服务 ·

0

Learning · Agent · 机器人 · Performer · 优化器 ·

2022 年 7 月 29 日

Multi-Phase Multi-Objective Dexterous Manipulation with Adaptive Hierarchical Curriculum

翻译：多阶段多目的、多目的、多目的、多目的、有适应性等级课程的操纵

Lingfeng Tao,Jiucai Zhang,Xiaoli Zhang

from arxiv, Accepted by the Journal of Intelligent & Robotic Systems

Dexterous manipulation tasks usually have multiple objectives, and the priorities of these objectives may vary at different phases of a manipulation task. Varying priority makes a robot hardly or even failed to learn an optimal policy with a deep reinforcement learning (DRL) method. To solve this problem, we develop a novel Adaptive Hierarchical Reward Mechanism (AHRM) to guide the DRL agent to learn manipulation tasks with multiple prioritized objectives. The AHRM can determine the objective priorities during the learning process and update the reward hierarchy to adapt to the changing objective priorities at different phases. The proposed method is validated in a multi-objective manipulation task with a JACO robot arm in which the robot needs to manipulate a target with obstacles surrounded. The simulation and physical experiment results show that the proposed method improved robot learning in task performance and learning efficiency.

翻译：不相干操纵任务通常具有多重目标,而且这些目标的优先事项在操纵任务的不同阶段可能各不相同。不同的优先级使得机器人几乎或甚至无法学习一种最佳政策,采用深层强化学习(DRL)方法。为了解决这个问题,我们开发了一个新型的适应性等级奖励机制(AHRM)来指导DRL代理机构学习具有多重优先目标的操纵任务。 AHRM可以确定学习过程中的客观优先事项,并更新奖励等级,以适应不同阶段不断变化的目标优先事项。拟议的方法在多目标操作任务中被验证, 由JACO机器人臂操作, 机器人需要用环绕障碍来操纵目标。模拟和物理实验结果显示, 拟议的方法可以改进机器人在任务性能和学习效率方面的学习。

0

相关内容

Learning

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

雷公藤甲素诱导急性早幼粒白血病细胞凋亡及自噬的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

从调控星形胶质细胞活化异质性探讨益肾化浊通络法对多发性硬化髓鞘再生适应性保护效应机制

国家自然科学基金

0+阅读 · 2013年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

油菜BnICE1基因与MAP激酶信号途径在调控植物耐寒性中的相互作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

Hierarchical Adaptive Loco-manipulation Control for Quadruped Robots

Arxiv

0+阅读 · 2022年9月27日

Unsupervised Reward Shaping for a Robotic Sequential Picking Task from Visual Observations in a Logistics Scenario

Arxiv

0+阅读 · 2022年9月25日

Barrier functions enable safety-conscious force-feedback control

Arxiv

0+阅读 · 2022年9月25日

Robust Reinforcement Learning as a Stackelberg Game via Adaptively-Regularized Adversarial Training

Arxiv

0+阅读 · 2022年9月24日

Planning for Multi-Object Manipulation with Graph Neural Network Relational Classifiers

Arxiv

0+阅读 · 2022年9月24日

VIP会员

文章信息

相关主题

相关VIP内容

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型中的事件抽取：方法、模态与未来展望的全面综述

美海军作战管理系统：变革战场空间的二十年

【MIT博士论文】以语言为中心的医学影像理解

俄罗斯“沙希德”/“天竺葵”攻击无人机

相关资讯

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

相关论文

Hierarchical Adaptive Loco-manipulation Control for Quadruped Robots

Arxiv

0+阅读 · 2022年9月27日

Unsupervised Reward Shaping for a Robotic Sequential Picking Task from Visual Observations in a Logistics Scenario

Arxiv

0+阅读 · 2022年9月25日

Barrier functions enable safety-conscious force-feedback control

Arxiv

0+阅读 · 2022年9月25日

Robust Reinforcement Learning as a Stackelberg Game via Adaptively-Regularized Adversarial Training

Arxiv

0+阅读 · 2022年9月24日

Planning for Multi-Object Manipulation with Graph Neural Network Relational Classifiers

Arxiv

0+阅读 · 2022年9月24日

相关基金

雷公藤甲素诱导急性早幼粒白血病细胞凋亡及自噬的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

从调控星形胶质细胞活化异质性探讨益肾化浊通络法对多发性硬化髓鞘再生适应性保护效应机制

国家自然科学基金

0+阅读 · 2013年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

油菜BnICE1基因与MAP激酶信号途径在调控植物耐寒性中的相互作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员