机器人手部物体全方位抓取神经网络的学习，基于手-物体的几何和空间表示 (DexRepNet: Learning Dexterous Robotic Grasping Network with Geometric and Spatial Hand-Object Representations) - 专知论文

会员服务 ·

0

Learning · Microsoft Surface · INTERACT · 机器人 · 表示 ·

2023 年 3 月 17 日

DexRepNet: Learning Dexterous Robotic Grasping Network with Geometric and Spatial Hand-Object Representations

翻译：机器人手部物体全方位抓取神经网络的学习，基于手-物体的几何和空间表示

Qingtao Liu,Yu Cui,Zhengnan Sun,Haoming Li,Gaofeng Li,Lin Shao,Jiming Chen,Qi Ye

from arxiv, IROS2023(Under Review)

Robotic dexterous grasping is a challenging problem due to the high degree of freedom (DoF) and complex contacts of multi-fingered robotic hands. Existing deep reinforcement learning (DRL) based methods leverage human demonstrations to reduce sample complexity due to the high dimensional action space with dexterous grasping. However, less attention has been paid to hand-object interaction representations for high-level generalization. In this paper, we propose a novel geometric and spatial hand-object interaction representation, named DexRep, to capture dynamic object shape features and the spatial relations between hands and objects during grasping. DexRep comprises Occupancy Feature for rough shapes within sensing range by moving hands, Surface Feature for changing hand-object surface distances, and Local-Geo Feature for local geometric surface features most related to potential contacts. Based on the new representation, we propose a dexterous deep reinforcement learning method to learn a generalizable grasping policy DexRepNet. Experimental results show that our method outperforms baselines using existing representations for robotic grasping dramatically both in grasp success rate and convergence speed. It achieves a 93\% grasping success rate on seen objects and higher than 80\% grasping success rates on diverse objects of unseen categories in both simulation and real-world experiments.

翻译：机器人手部物体的全方位抓取是一个具有挑战性的问题，由于多指机器手的自由度高和接触面复杂。现有的深度强化学习（DRL）方法利用人类的演示来降低因复杂的抓取动作空间而导致的样本复杂性。然而，在高级别的泛化方面，手-物体相互作用表示却没有受到足够的关注。在本文中，我们提出了一种新颖的几何和空间的手-物体交互表示——DexRep，用于捕捉抓取过程中物体的动态形状特征以及手和物体之间的空间关系。DexRep由三部分构成：占据特征，用于描述移动手时感知范围内物体的粗糙形状；表面特征，用于描述手与物体表面之间的距离变化；局部几何特征，用于描述与潜在接触面最相关的局部几何形状特征。基于这种新的表示方法，我们提出了一种基于DRL的抓取策略学习方法DexRepNet。实验结果显示，我们的方法在抓取成功率和收敛速度方面都明显优于使用现有表示方法的基线。在仿真和实际世界的实验中，它在看到的对象上达到了93%的抓取成功率，并在未见过的不同种类的对象上获得了80%以上的抓取成功率。

0

相关内容

Learning

Chem Sci｜用于药物-药物相互作用预测的子结构感知图神经网络

Chem Sci｜用于药物-药物相互作用预测的子结构感知图神经网络

专知会员服务

14+阅读 · 2022年12月19日

【斯坦福CVPR2022】EG3D:高效的几何感知三维生成对抗网络，EG3D: Efficient Geometry-aware 3D Generative Adversarial Networks

【斯坦福CVPR2022】EG3D:高效的几何感知三维生成对抗网络，EG3D: Efficient Geometry-aware 3D Generative Adversarial Networks

专知会员服务

18+阅读 · 2022年3月15日

【CVPR2020】视觉导航的神经拓扑SLAM，56页ppt，Neural Topological SLAM for Visual Navigation

【CVPR2020】视觉导航的神经拓扑SLAM，56页ppt，Neural Topological SLAM for Visual Navigation

专知会员服务

14+阅读 · 2020年6月18日

GRAPH-BERT ：学习图表示只需要注意力，GRAPH-BERT : Only Attention is Needed for Learning Graph Representations

GRAPH-BERT ：学习图表示只需要注意力，GRAPH-BERT : Only Attention is Needed for Learning Graph Representations

专知会员服务

78+阅读 · 2020年5月31日

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

专知会员服务

52+阅读 · 2020年5月26日

【CVPR2020】自监督的深度视觉测程与在线适应，Self-Supervised Deep Visual Odometry

【CVPR2020】自监督的深度视觉测程与在线适应，Self-Supervised Deep Visual Odometry

专知会员服务

32+阅读 · 2020年5月14日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【华盛顿大学】用于视觉和语言导航的多视图学习，Multi-View Learning for Vision-and-Language Navigation

【华盛顿大学】用于视觉和语言导航的多视图学习，Multi-View Learning for Vision-and-Language Navigation

专知会员服务

31+阅读 · 2020年3月11日

【伯克利】用于文本推理的神经模块网络，Neural Module Networks for Reasoning over Text

【伯克利】用于文本推理的神经模块网络，Neural Module Networks for Reasoning over Text

专知会员服务

35+阅读 · 2019年12月10日

缠成一坨的耳机线，这机器人两下就能解开

缠成一坨的耳机线，这机器人两下就能解开

量子位

0+阅读 · 2022年8月14日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

【泡泡一分钟】扫描环境：用于3D点云地图中场景识别的自我中心空间描述符

【泡泡一分钟】扫描环境：用于3D点云地图中场景识别的自我中心空间描述符

泡泡机器人SLAM

22+阅读 · 2019年1月17日

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

泡泡机器人SLAM

22+阅读 · 2018年12月4日

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

泡泡机器人SLAM

17+阅读 · 2018年10月12日

【泡泡点云时空】用于点云识别的注意力形状上下文网络（CVPR2018-1）

【泡泡点云时空】用于点云识别的注意力形状上下文网络（CVPR2018-1）

泡泡机器人SLAM

33+阅读 · 2018年8月6日

【泡泡一分钟】学习紧密的几何特征（ICCV2017-17）

【泡泡一分钟】学习紧密的几何特征（ICCV2017-17）

泡泡机器人SLAM

20+阅读 · 2018年5月8日

【论文推荐】最新六篇图像分割相关论文—控制、全卷积网络、子空间表示、多模态图像分割

【论文推荐】最新六篇图像分割相关论文—控制、全卷积网络、子空间表示、多模态图像分割

专知

25+阅读 · 2018年4月15日

空间大型机械臂关节用多级行星传动系统动力学基础理论及实验研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于立体视觉的微型扑翼机器人的自主飞行控制

国家自然科学基金

3+阅读 · 2014年12月31日

水解诱导聚乳酸及其碳基纳米复合材料分子有序与结晶的研究

国家自然科学基金

0+阅读 · 2014年12月31日

时间-空间可控生物活性分子释放的胶原支架用于脊髓损伤神经网络重建的研究

国家自然科学基金

0+阅读 · 2014年12月31日

图像中复杂形变物体的外轮廓搜索方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

移动机器人基于三维激光测距的室内场景认知与物体识别

国家自然科学基金

0+阅读 · 2012年12月31日

基于计算几何与图论的动态目标协作搜索机制及其算法研究

国家自然科学基金

1+阅读 · 2011年12月31日

正空间搜索方法及类钙钛矿氧化物精细结构分析研究

国家自然科学基金

0+阅读 · 2009年12月31日

节律-技巧混合驱动的机器人行走机理研究及实验验证

国家自然科学基金

0+阅读 · 2008年12月31日

p进表示的伽罗瓦上同调

国家自然科学基金

0+阅读 · 2008年12月31日

GNNs,You can be Stronger,Deeper and Faster

Arxiv

1+阅读 · 2023年5月9日

Learning Dynamic Point Cloud Compression via Hierarchical Inter-frame Block Matching

Arxiv

0+阅读 · 2023年5月9日

Self-Supervised Learning from Non-Object Centric Images with a Geometric Transformation Sensitive Architecture

Arxiv

0+阅读 · 2023年5月9日

AVATAR: Adversarial self-superVised domain Adaptation network for TARget domain

Arxiv

0+阅读 · 2023年5月8日

On the Effectiveness of Equivariant Regularization for Robust Online Continual Learning

Arxiv

0+阅读 · 2023年5月5日

Contrastive Learning for Low-light Raw Denoising

Arxiv

0+阅读 · 2023年5月5日

Deep Multi-View Semi-Supervised Clustering with Sample Pairwise Constraints

Arxiv

0+阅读 · 2023年5月5日

Investigating the Properties of Neural Network Representations in Reinforcement Learning

Arxiv

0+阅读 · 2023年5月5日

Clothes Grasping and Unfolding Based on RGB-D Semantic Segmentation

Arxiv

0+阅读 · 2023年5月5日

MetAug: Contrastive Learning via Meta Feature Augmentation

Arxiv

10+阅读 · 2022年3月10日

VIP会员

文章信息

相关主题

Microsoft Surface

相关VIP内容

Chem Sci｜用于药物-药物相互作用预测的子结构感知图神经网络

Chem Sci｜用于药物-药物相互作用预测的子结构感知图神经网络

专知会员服务

14+阅读 · 2022年12月19日

【斯坦福CVPR2022】EG3D:高效的几何感知三维生成对抗网络，EG3D: Efficient Geometry-aware 3D Generative Adversarial Networks

【斯坦福CVPR2022】EG3D:高效的几何感知三维生成对抗网络，EG3D: Efficient Geometry-aware 3D Generative Adversarial Networks

专知会员服务

18+阅读 · 2022年3月15日

【CVPR2020】视觉导航的神经拓扑SLAM，56页ppt，Neural Topological SLAM for Visual Navigation

【CVPR2020】视觉导航的神经拓扑SLAM，56页ppt，Neural Topological SLAM for Visual Navigation

专知会员服务

14+阅读 · 2020年6月18日

GRAPH-BERT ：学习图表示只需要注意力，GRAPH-BERT : Only Attention is Needed for Learning Graph Representations

GRAPH-BERT ：学习图表示只需要注意力，GRAPH-BERT : Only Attention is Needed for Learning Graph Representations

专知会员服务

78+阅读 · 2020年5月31日

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

专知会员服务

52+阅读 · 2020年5月26日

【CVPR2020】自监督的深度视觉测程与在线适应，Self-Supervised Deep Visual Odometry

【CVPR2020】自监督的深度视觉测程与在线适应，Self-Supervised Deep Visual Odometry

专知会员服务

32+阅读 · 2020年5月14日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【华盛顿大学】用于视觉和语言导航的多视图学习，Multi-View Learning for Vision-and-Language Navigation

【华盛顿大学】用于视觉和语言导航的多视图学习，Multi-View Learning for Vision-and-Language Navigation

专知会员服务

31+阅读 · 2020年3月11日

【伯克利】用于文本推理的神经模块网络，Neural Module Networks for Reasoning over Text

【伯克利】用于文本推理的神经模块网络，Neural Module Networks for Reasoning over Text

专知会员服务

35+阅读 · 2019年12月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《太空对抗中未知追踪者目标下的规避策略研究》122页

AlphaMosaic：人工智能赋能的作战管理系统

《算法战争研究计划全景评估》35页

《分层多智能体系统分类：设计范式、协调机制与工业应用》最新28页

相关资讯

缠成一坨的耳机线，这机器人两下就能解开

缠成一坨的耳机线，这机器人两下就能解开

量子位

0+阅读 · 2022年8月14日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

【泡泡一分钟】扫描环境：用于3D点云地图中场景识别的自我中心空间描述符

【泡泡一分钟】扫描环境：用于3D点云地图中场景识别的自我中心空间描述符

泡泡机器人SLAM

22+阅读 · 2019年1月17日

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

泡泡机器人SLAM

22+阅读 · 2018年12月4日

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

泡泡机器人SLAM

17+阅读 · 2018年10月12日

【泡泡点云时空】用于点云识别的注意力形状上下文网络（CVPR2018-1）

【泡泡点云时空】用于点云识别的注意力形状上下文网络（CVPR2018-1）

泡泡机器人SLAM

33+阅读 · 2018年8月6日

【泡泡一分钟】学习紧密的几何特征（ICCV2017-17）

【泡泡一分钟】学习紧密的几何特征（ICCV2017-17）

泡泡机器人SLAM

20+阅读 · 2018年5月8日

【论文推荐】最新六篇图像分割相关论文—控制、全卷积网络、子空间表示、多模态图像分割

【论文推荐】最新六篇图像分割相关论文—控制、全卷积网络、子空间表示、多模态图像分割

专知

25+阅读 · 2018年4月15日

相关论文

GNNs,You can be Stronger,Deeper and Faster

Arxiv

1+阅读 · 2023年5月9日

Learning Dynamic Point Cloud Compression via Hierarchical Inter-frame Block Matching

Arxiv

0+阅读 · 2023年5月9日

Self-Supervised Learning from Non-Object Centric Images with a Geometric Transformation Sensitive Architecture

Arxiv

0+阅读 · 2023年5月9日

AVATAR: Adversarial self-superVised domain Adaptation network for TARget domain

Arxiv

0+阅读 · 2023年5月8日

On the Effectiveness of Equivariant Regularization for Robust Online Continual Learning

Arxiv

0+阅读 · 2023年5月5日

Contrastive Learning for Low-light Raw Denoising

Arxiv

0+阅读 · 2023年5月5日

Deep Multi-View Semi-Supervised Clustering with Sample Pairwise Constraints

Arxiv

0+阅读 · 2023年5月5日

Investigating the Properties of Neural Network Representations in Reinforcement Learning

Arxiv

0+阅读 · 2023年5月5日

Clothes Grasping and Unfolding Based on RGB-D Semantic Segmentation

Arxiv

0+阅读 · 2023年5月5日

MetAug: Contrastive Learning via Meta Feature Augmentation

Arxiv

10+阅读 · 2022年3月10日

相关基金

空间大型机械臂关节用多级行星传动系统动力学基础理论及实验研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于立体视觉的微型扑翼机器人的自主飞行控制

国家自然科学基金

3+阅读 · 2014年12月31日

水解诱导聚乳酸及其碳基纳米复合材料分子有序与结晶的研究

国家自然科学基金

0+阅读 · 2014年12月31日

时间-空间可控生物活性分子释放的胶原支架用于脊髓损伤神经网络重建的研究

国家自然科学基金

0+阅读 · 2014年12月31日

图像中复杂形变物体的外轮廓搜索方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

移动机器人基于三维激光测距的室内场景认知与物体识别

国家自然科学基金

0+阅读 · 2012年12月31日

基于计算几何与图论的动态目标协作搜索机制及其算法研究

国家自然科学基金

1+阅读 · 2011年12月31日

正空间搜索方法及类钙钛矿氧化物精细结构分析研究

国家自然科学基金

0+阅读 · 2009年12月31日

节律-技巧混合驱动的机器人行走机理研究及实验验证

国家自然科学基金

0+阅读 · 2008年12月31日

p进表示的伽罗瓦上同调

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员