机器人布料操作中的顺序最优化准静态和动态操作基元学习：QDP (QDP: Learning to Sequentially Optimise Quasi-Static and Dynamic Manipulation Primitives for Robotic Cloth Manipulation) - 专知论文

会员服务 ·

0

Performer · Learning · 机器人 · HTTPS · MASS ·

2023 年 3 月 23 日

QDP: Learning to Sequentially Optimise Quasi-Static and Dynamic Manipulation Primitives for Robotic Cloth Manipulation

翻译：机器人布料操作中的顺序最优化准静态和动态操作基元学习：QDP

David Blanco-Mulero,Gokhan Alcan,Fares J. Abu-Dakka,Ville Kyrki

from arxiv, 8 pages, 7 figures. Supplementary material available at https://sites.google.com/view/qdp-srl

Pre-defined manipulation primitives are widely used for cloth manipulation. However, cloth properties such as its stiffness or density can highly impact the performance of these primitives. Although existing solutions have tackled the parameterisation of pick and place locations, the effect of factors such as the velocity or trajectory of quasi-static and dynamic manipulation primitives has been neglected. Choosing appropriate values for these parameters is crucial to cope with the range of materials present in house-hold cloth objects. To address this challenge, we introduce the Quasi-Dynamic Parameterisable (QDP) method, which optimises parameters such as the motion velocity in addition to the pick and place positions of quasi-static and dynamic manipulation primitives. In this work, we leverage the framework of Sequential Reinforcement Learning to decouple sequentially the parameters that compose the primitives. To evaluate the effectiveness of the method we focus on the task of cloth unfolding with a robotic arm in simulation and real-world experiments. Our results in simulation show that by deciding the optimal parameters for the primitives the performance can improve by 20% compared to sub-optimal ones. Real-world results demonstrate the advantage of modifying the velocity and height of manipulation primitives for cloths with different mass, stiffness, shape and size. Supplementary material, videos, and code, can be found at https://sites.google.com/view/qdp-srl.

翻译：预定义的操作基元广泛用于布料操作。然而，布料的刚度或密度等性质可以高度影响这些基元的性能。尽管现有解决方案已经解决了拾放位置的参数化问题，但准静态和动态操作基元的速度或轨迹等因素的影响被忽略了。选择适当的参数值是应对家用布料中存在的材料范围的关键。为了解决这一挑战，我们介绍了准动态可参数化（QDP）方法，该方法优化参数，例如准静态和动态操作基元的运动速度以及拾放位置。在这项工作中，我们利用连续强化学习的框架，将组成基元的参数分解为顺序子问题。为了评估该方法的有效性，我们专注于模拟和实际实验中的布料展开任务。我们在模拟中的结果表明，通过决定基元的最优参数，性能可以提高20％，与次优参数相比。实际结果表明，修改不同质量、刚度、形状和大小的布料的操作基元速度和高度具有优势。补充材料、视频和代码可在https://sites.google.com/view/qdp-srl找到。

0

相关内容

Performer

【CMU博士论文】分布式强化学习自动驾驶，100页pdf

【CMU博士论文】分布式强化学习自动驾驶，100页pdf

专知会员服务

37+阅读 · 2023年4月17日

视频自监督学习综述

视频自监督学习综述

专知会员服务

53+阅读 · 2022年7月5日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

近期必读的5篇顶会CVPR 2021【行为识别】相关论文和代码

专知会员服务

60+阅读 · 2021年3月17日

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

专知会员服务

40+阅读 · 2020年9月21日

【CMU博士论文】机器人深度强化学习，128页pdf

【CMU博士论文】机器人深度强化学习，128页pdf

专知会员服务

133+阅读 · 2020年8月27日

【牛津大学ICLR2020】通过元学习的贝叶斯自适应深度RL, VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning

【牛津大学ICLR2020】通过元学习的贝叶斯自适应深度RL, VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning

专知会员服务

25+阅读 · 2020年2月28日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

17种深度强化学习算法用Pytorch实现

17种深度强化学习算法用Pytorch实现

新智元

31+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

【泡泡一分钟】优化对比度增强以提高SLAM重定位环境中视觉跟踪的稳健性

【泡泡一分钟】优化对比度增强以提高SLAM重定位环境中视觉跟踪的稳健性

泡泡机器人SLAM

10+阅读 · 2019年4月26日

【泡泡一分钟】DS-SLAM: 动态环境下的语义视觉SLAM

【泡泡一分钟】DS-SLAM: 动态环境下的语义视觉SLAM

泡泡机器人SLAM

23+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【泡泡一分钟】基于李群的无损卡尔曼滤波器在视觉里程计上的应用

【泡泡一分钟】基于李群的无损卡尔曼滤波器在视觉里程计上的应用

泡泡机器人SLAM

11+阅读 · 2018年12月17日

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

泡泡机器人SLAM

22+阅读 · 2018年12月4日

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

泡泡机器人SLAM

13+阅读 · 2018年10月7日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【泡泡一分钟】动态环境下稳健的单目SLAM

【泡泡一分钟】动态环境下稳健的单目SLAM

泡泡机器人SLAM

13+阅读 · 2018年3月22日

基于虚拟螺旋运动坐标系的捷联速度算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

流体中形状优化问题的高可扩展并行区域分解算法

国家自然科学基金

1+阅读 · 2013年12月31日

椭球上基于几何扩张的欠驱动运动体协同路径跟踪控制

国家自然科学基金

0+阅读 · 2012年12月31日

基于光致变形微纳夹持器的三维纳米操作机器人研究

国家自然科学基金

0+阅读 · 2012年12月31日

功能梯度材料对称结构的静动力学问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

基于无酶循环放大策略的多通道纳米电化学生物传感器用于肝癌多元肿瘤标志物的联合检测

国家自然科学基金

0+阅读 · 2012年12月31日

Navier-Stokes方程的三角形cut-cell自适应有限元方法

国家自然科学基金

0+阅读 · 2011年12月31日

有约束的航天器高精度快速姿态跟踪动力学与控制特性研究

国家自然科学基金

1+阅读 · 2011年12月31日

独立操作型可重构机器人群体矢量构形研究

国家自然科学基金

0+阅读 · 2009年12月31日

Double-Iterative Gaussian Process Regression for Modeling Error Compensation in Autonomous Racing

Arxiv

0+阅读 · 2023年5月12日

Motion Macro Programming on Assistive Robotic Manipulators: Three Skill Types for Everyday Tasks

Arxiv

0+阅读 · 2023年5月12日

Initial Steps Towards Tackling High-dimensional Surrogate Modeling for Neuroevolution Using Kriging Partial Least Squares

Arxiv

0+阅读 · 2023年5月11日

NUBO: A Transparent Python Package for Bayesian Optimisation

Arxiv

0+阅读 · 2023年5月11日

Causal Policy Gradient for Whole-Body Mobile Manipulation

Arxiv

0+阅读 · 2023年5月11日

Concentric Tube Robot Redundancy Resolution via Velocity/Compliance Manipulability Optimization

Arxiv

0+阅读 · 2023年5月10日

Motion Planning for Autonomous Driving: The State of the Art and Future Perspectives

Arxiv

0+阅读 · 2023年5月10日

Achieving Diversity in Counterfactual Explanations: a Review and Discussion

Arxiv

0+阅读 · 2023年5月10日

Information Design in Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2023年5月8日

Automated Graph Machine Learning: Approaches, Libraries and Directions

Arxiv

20+阅读 · 2022年1月4日

VIP会员

文章信息

相关主题

相关VIP内容

【CMU博士论文】分布式强化学习自动驾驶，100页pdf

【CMU博士论文】分布式强化学习自动驾驶，100页pdf

专知会员服务

37+阅读 · 2023年4月17日

视频自监督学习综述

视频自监督学习综述

专知会员服务

53+阅读 · 2022年7月5日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

近期必读的5篇顶会CVPR 2021【行为识别】相关论文和代码

专知会员服务

60+阅读 · 2021年3月17日

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

专知会员服务

40+阅读 · 2020年9月21日

【CMU博士论文】机器人深度强化学习，128页pdf

【CMU博士论文】机器人深度强化学习，128页pdf

专知会员服务

133+阅读 · 2020年8月27日

【牛津大学ICLR2020】通过元学习的贝叶斯自适应深度RL, VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning

【牛津大学ICLR2020】通过元学习的贝叶斯自适应深度RL, VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning

专知会员服务

25+阅读 · 2020年2月28日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

17种深度强化学习算法用Pytorch实现

17种深度强化学习算法用Pytorch实现

新智元

31+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

【泡泡一分钟】优化对比度增强以提高SLAM重定位环境中视觉跟踪的稳健性

【泡泡一分钟】优化对比度增强以提高SLAM重定位环境中视觉跟踪的稳健性

泡泡机器人SLAM

10+阅读 · 2019年4月26日

【泡泡一分钟】DS-SLAM: 动态环境下的语义视觉SLAM

【泡泡一分钟】DS-SLAM: 动态环境下的语义视觉SLAM

泡泡机器人SLAM

23+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【泡泡一分钟】基于李群的无损卡尔曼滤波器在视觉里程计上的应用

【泡泡一分钟】基于李群的无损卡尔曼滤波器在视觉里程计上的应用

泡泡机器人SLAM

11+阅读 · 2018年12月17日

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

泡泡机器人SLAM

22+阅读 · 2018年12月4日

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

泡泡机器人SLAM

13+阅读 · 2018年10月7日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【泡泡一分钟】动态环境下稳健的单目SLAM

【泡泡一分钟】动态环境下稳健的单目SLAM

泡泡机器人SLAM

13+阅读 · 2018年3月22日

相关论文

Double-Iterative Gaussian Process Regression for Modeling Error Compensation in Autonomous Racing

Arxiv

0+阅读 · 2023年5月12日

Motion Macro Programming on Assistive Robotic Manipulators: Three Skill Types for Everyday Tasks

Arxiv

0+阅读 · 2023年5月12日

Initial Steps Towards Tackling High-dimensional Surrogate Modeling for Neuroevolution Using Kriging Partial Least Squares

Arxiv

0+阅读 · 2023年5月11日

NUBO: A Transparent Python Package for Bayesian Optimisation

Arxiv

0+阅读 · 2023年5月11日

Causal Policy Gradient for Whole-Body Mobile Manipulation

Arxiv

0+阅读 · 2023年5月11日

Concentric Tube Robot Redundancy Resolution via Velocity/Compliance Manipulability Optimization

Arxiv

0+阅读 · 2023年5月10日

Motion Planning for Autonomous Driving: The State of the Art and Future Perspectives

Arxiv

0+阅读 · 2023年5月10日

Achieving Diversity in Counterfactual Explanations: a Review and Discussion

Arxiv

0+阅读 · 2023年5月10日

Information Design in Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2023年5月8日

Automated Graph Machine Learning: Approaches, Libraries and Directions

Arxiv

20+阅读 · 2022年1月4日

相关基金

基于虚拟螺旋运动坐标系的捷联速度算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

流体中形状优化问题的高可扩展并行区域分解算法

国家自然科学基金

1+阅读 · 2013年12月31日

椭球上基于几何扩张的欠驱动运动体协同路径跟踪控制

国家自然科学基金

0+阅读 · 2012年12月31日

基于光致变形微纳夹持器的三维纳米操作机器人研究

国家自然科学基金

0+阅读 · 2012年12月31日

功能梯度材料对称结构的静动力学问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

基于无酶循环放大策略的多通道纳米电化学生物传感器用于肝癌多元肿瘤标志物的联合检测

国家自然科学基金

0+阅读 · 2012年12月31日

Navier-Stokes方程的三角形cut-cell自适应有限元方法

国家自然科学基金

0+阅读 · 2011年12月31日

有约束的航天器高精度快速姿态跟踪动力学与控制特性研究

国家自然科学基金

1+阅读 · 2011年12月31日

独立操作型可重构机器人群体矢量构形研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员