PartManip: 从点云观察中学习跨类别通用部件操纵策略 (PartManip: Learning Cross-Category Generalizable Part Manipulation Policy from Point Cloud Observations) - 专知论文

会员服务 ·

0

部件 · 类别 · 点云 · 操作 · 泛化 ·

2023 年 3 月 29 日

PartManip: Learning Cross-Category Generalizable Part Manipulation Policy from Point Cloud Observations

翻译：PartManip: 从点云观察中学习跨类别通用部件操纵策略

Haoran Geng,Ziming Li,Yiran Geng,Jiayi Chen,Hao Dong,He Wang

from arxiv, Accepted by CVPR2023

Learning a generalizable object manipulation policy is vital for an embodied agent to work in complex real-world scenes. Parts, as the shared components in different object categories, have the potential to increase the generalization ability of the manipulation policy and achieve cross-category object manipulation. In this work, we build the first large-scale, part-based cross-category object manipulation benchmark, PartManip, which is composed of 11 object categories, 494 objects, and 1432 tasks in 6 task classes. Compared to previous work, our benchmark is also more diverse and realistic, i.e., having more objects and using sparse-view point cloud as input without oracle information like part segmentation. To tackle the difficulties of vision-based policy learning, we first train a state-based expert with our proposed part-based canonicalization and part-aware rewards, and then distill the knowledge to a vision-based student. We also find an expressive backbone is essential to overcome the large diversity of different objects. For cross-category generalization, we introduce domain adversarial learning for domain-invariant feature extraction. Extensive experiments in simulation show that our learned policy can outperform other methods by a large margin, especially on unseen object categories. We also demonstrate our method can successfully manipulate novel objects in the real world.

翻译：学习通用的对象操作策略对于一个具有实体代理的实体在复杂的现实场景中发挥作用非常关键。部件作为不同对象类别的共享组件，有潜力增加操作策略的泛化能力，并实现跨类别的对象操作。在这项工作中，我们建立了第一个大规模的基于部件的跨类别对象操作基准（PartManip），它由 11 个对象类别、494 个对象和 6 个任务类别中的 1432 个任务组成。相比之前的工作，我们的基准还更加多样化和真实，即具有更多的对象并使用稀疏视图点云作为输入，而不需要像部件分割这样的神谕信息。为了解决基于视觉的策略学习的困难，我们首先使用我们提出的基于部件的规范化和部件感知的奖励训练一个基于状态的专家，然后将知识提炼到一个基于视觉的学生中。我们还发现，表达丰富的骨干网络对于克服不同对象的大型多样性至关重要。为了实现跨类别泛化，我们引入了领域对抗学习进行域不变特征提取。在模拟实验中进行的广泛实验证明，我们学习到的策略可以在很大程度上优于其他方法，特别是在未见过的对象类别上。我们还展示了我们的方法可以成功地操作现实世界中的新颖对象。

0

相关内容

【伯克利博士论文】机器人机械搜索的操作与感知策略

【伯克利博士论文】机器人机械搜索的操作与感知策略

专知会员服务

16+阅读 · 2022年6月4日

【布朗大学David Abel博士论文】A Theory of Abstraction in Reinforcement Learning

【布朗大学David Abel博士论文】A Theory of Abstraction in Reinforcement Learning

专知会员服务

25+阅读 · 2022年3月16日

【CVPR2021】CVPR2021 | MotionRNN：针对复杂时空运动的通用视频预测模型

专知会员服务

14+阅读 · 2021年4月22日

【CVPR2021】面向开放世界的目标检测

专知会员服务

27+阅读 · 2021年3月5日

【牛津大学BoYang博士论文】学习重建和分割三维物体，143页pdf

【牛津大学BoYang博士论文】学习重建和分割三维物体，143页pdf

专知会员服务

67+阅读 · 2020年11月9日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【CVPR2020-国科大】状态标签对抗主动学习，Adversarial Active Learning

【CVPR2020-国科大】状态标签对抗主动学习，Adversarial Active Learning

专知会员服务

48+阅读 · 2020年4月13日

【旷视-CVPR2020】领域自适应对象检测的探索类别正则化，Exploring Categorical Regularization for Domain Adaptive Object Detection

【旷视-CVPR2020】领域自适应对象检测的探索类别正则化，Exploring Categorical Regularization for Domain Adaptive Object Detection

专知会员服务

38+阅读 · 2020年3月23日

【CVPR2020】从未标记的视频中学习视频对象分割，Learning Video Object Segmentation from Unlabeled Videos

【CVPR2020】从未标记的视频中学习视频对象分割，Learning Video Object Segmentation from Unlabeled Videos

专知会员服务

36+阅读 · 2020年3月12日

【AAAI2020论文-腾讯】通过稠密边界发生器快速学习时间动作方案（Fast Learning of Temporal Action Proposal via Dense Boundary Generator）

【AAAI2020论文-腾讯】通过稠密边界发生器快速学习时间动作方案（Fast Learning of Temporal Action Proposal via Dense Boundary Generator）

专知会员服务

12+阅读 · 2019年11月15日

【三星AI-CVPR2020】增量小样本目标检测，Incremental Few-Shot Object Detection

【三星AI-CVPR2020】增量小样本目标检测，Incremental Few-Shot Object Detection

专知

55+阅读 · 2020年3月11日

【泡泡点云时空】SqueezeSegV2：改进模型结构和无监督领域自适应的激光雷达点云道路目标分割方法

【泡泡点云时空】SqueezeSegV2：改进模型结构和无监督领域自适应的激光雷达点云道路目标分割方法

泡泡机器人SLAM

11+阅读 · 2019年9月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Zero-Shot Learning相关资源大列表

Zero-Shot Learning相关资源大列表

专知

52+阅读 · 2019年1月1日

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

泡泡机器人SLAM

13+阅读 · 2018年10月7日

【泡泡点云时空】用于点云识别的注意力形状上下文网络（CVPR2018-1）

【泡泡点云时空】用于点云识别的注意力形状上下文网络（CVPR2018-1）

泡泡机器人SLAM

33+阅读 · 2018年8月6日

【泡泡一分钟】神经SLAM：使用外部存储器让智能体学习探索环境

【泡泡一分钟】神经SLAM：使用外部存储器让智能体学习探索环境

泡泡机器人SLAM

12+阅读 · 2018年4月17日

论文 | YOLO（You Only Look Once）目标检测

论文 | YOLO（You Only Look Once）目标检测

七月在线实验室

14+阅读 · 2017年12月12日

基于环境异质信息的机器觉察与仿生知觉方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于机器学习的室外未知环境中移动机器人定位研究

国家自然科学基金

4+阅读 · 2014年12月31日

图像细粒度识别的显著性特征学习算法研究

国家自然科学基金

2+阅读 · 2014年12月31日

基于目标导向MI-EEGas的上肢运动康复方法双向适应性研究

国家自然科学基金

0+阅读 · 2014年12月31日

行人检测中粒度空间特征提取方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

现实场景的多传感采样与三维重建方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

迁移学习在图像分类中的应用研究

国家自然科学基金

8+阅读 · 2013年12月31日

动态环境下基于概率图模型的机器人地点识别及实时语义地图构建新方法

国家自然科学基金

0+阅读 · 2012年12月31日

Wnt/β-catenin信号通路在TGF-β1诱导的真皮成纤维细胞向肌成纤维细胞表型转化中的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于运动模式在线学习的移动机器人对运动目标的主动观测与最优跟踪

国家自然科学基金

0+阅读 · 2011年12月31日

Multi-spectral Class Center Network for Face Manipulation Detection and Localization

Arxiv

0+阅读 · 2023年5月18日

Deep Class-Incremental Learning: A Survey

Arxiv

13+阅读 · 2023年2月7日

A Comprehensive Survey on Deep Clustering: Taxonomy, Challenges, and Future Directions

Arxiv

42+阅读 · 2022年6月15日

A Survey of Deep Learning for Low-Shot Object Detection

Arxiv

21+阅读 · 2021年12月6日

Multimodality in Meta-Learning: A Comprehensive Survey

Arxiv

37+阅读 · 2021年9月28日

Towards Out-Of-Distribution Generalization: A Survey

Arxiv

38+阅读 · 2021年8月31日

Image Manipulation Detection by Multi-View Multi-Scale Supervision

Arxiv

13+阅读 · 2021年7月25日

Domain Generalization in Vision: A Survey

Arxiv

16+阅读 · 2021年7月18日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

A survey on Semi-, Self- and Unsupervised Techniques in Image Classification

A survey on Semi-, Self- and Unsupervised Techniques in Image Classification

Arxiv

100+阅读 · 2020年2月20日

VIP会员

文章信息

相关主题

相关VIP内容

【伯克利博士论文】机器人机械搜索的操作与感知策略

【伯克利博士论文】机器人机械搜索的操作与感知策略

专知会员服务

16+阅读 · 2022年6月4日

【布朗大学David Abel博士论文】A Theory of Abstraction in Reinforcement Learning

【布朗大学David Abel博士论文】A Theory of Abstraction in Reinforcement Learning

专知会员服务

25+阅读 · 2022年3月16日

【CVPR2021】CVPR2021 | MotionRNN：针对复杂时空运动的通用视频预测模型

专知会员服务

14+阅读 · 2021年4月22日

【CVPR2021】面向开放世界的目标检测

专知会员服务

27+阅读 · 2021年3月5日

【牛津大学BoYang博士论文】学习重建和分割三维物体，143页pdf

【牛津大学BoYang博士论文】学习重建和分割三维物体，143页pdf

专知会员服务

67+阅读 · 2020年11月9日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【CVPR2020-国科大】状态标签对抗主动学习，Adversarial Active Learning

【CVPR2020-国科大】状态标签对抗主动学习，Adversarial Active Learning

专知会员服务

48+阅读 · 2020年4月13日

【旷视-CVPR2020】领域自适应对象检测的探索类别正则化，Exploring Categorical Regularization for Domain Adaptive Object Detection

【旷视-CVPR2020】领域自适应对象检测的探索类别正则化，Exploring Categorical Regularization for Domain Adaptive Object Detection

专知会员服务

38+阅读 · 2020年3月23日

【CVPR2020】从未标记的视频中学习视频对象分割，Learning Video Object Segmentation from Unlabeled Videos

【CVPR2020】从未标记的视频中学习视频对象分割，Learning Video Object Segmentation from Unlabeled Videos

专知会员服务

36+阅读 · 2020年3月12日

【AAAI2020论文-腾讯】通过稠密边界发生器快速学习时间动作方案（Fast Learning of Temporal Action Proposal via Dense Boundary Generator）

【AAAI2020论文-腾讯】通过稠密边界发生器快速学习时间动作方案（Fast Learning of Temporal Action Proposal via Dense Boundary Generator）

专知会员服务

12+阅读 · 2019年11月15日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

【三星AI-CVPR2020】增量小样本目标检测，Incremental Few-Shot Object Detection

【三星AI-CVPR2020】增量小样本目标检测，Incremental Few-Shot Object Detection

专知

55+阅读 · 2020年3月11日

【泡泡点云时空】SqueezeSegV2：改进模型结构和无监督领域自适应的激光雷达点云道路目标分割方法

【泡泡点云时空】SqueezeSegV2：改进模型结构和无监督领域自适应的激光雷达点云道路目标分割方法

泡泡机器人SLAM

11+阅读 · 2019年9月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Zero-Shot Learning相关资源大列表

Zero-Shot Learning相关资源大列表

专知

52+阅读 · 2019年1月1日

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

泡泡机器人SLAM

13+阅读 · 2018年10月7日

【泡泡点云时空】用于点云识别的注意力形状上下文网络（CVPR2018-1）

【泡泡点云时空】用于点云识别的注意力形状上下文网络（CVPR2018-1）

泡泡机器人SLAM

33+阅读 · 2018年8月6日

【泡泡一分钟】神经SLAM：使用外部存储器让智能体学习探索环境

【泡泡一分钟】神经SLAM：使用外部存储器让智能体学习探索环境

泡泡机器人SLAM

12+阅读 · 2018年4月17日

论文 | YOLO（You Only Look Once）目标检测

论文 | YOLO（You Only Look Once）目标检测

七月在线实验室

14+阅读 · 2017年12月12日

相关论文

Multi-spectral Class Center Network for Face Manipulation Detection and Localization

Arxiv

0+阅读 · 2023年5月18日

Deep Class-Incremental Learning: A Survey

Arxiv

13+阅读 · 2023年2月7日

A Comprehensive Survey on Deep Clustering: Taxonomy, Challenges, and Future Directions

Arxiv

42+阅读 · 2022年6月15日

A Survey of Deep Learning for Low-Shot Object Detection

Arxiv

21+阅读 · 2021年12月6日

Multimodality in Meta-Learning: A Comprehensive Survey

Arxiv

37+阅读 · 2021年9月28日

Towards Out-Of-Distribution Generalization: A Survey

Arxiv

38+阅读 · 2021年8月31日

Image Manipulation Detection by Multi-View Multi-Scale Supervision

Arxiv

13+阅读 · 2021年7月25日

Domain Generalization in Vision: A Survey

Arxiv

16+阅读 · 2021年7月18日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

A survey on Semi-, Self- and Unsupervised Techniques in Image Classification

A survey on Semi-, Self- and Unsupervised Techniques in Image Classification

Arxiv

100+阅读 · 2020年2月20日

相关基金

基于环境异质信息的机器觉察与仿生知觉方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于机器学习的室外未知环境中移动机器人定位研究

国家自然科学基金

4+阅读 · 2014年12月31日

图像细粒度识别的显著性特征学习算法研究

国家自然科学基金

2+阅读 · 2014年12月31日

基于目标导向MI-EEGas的上肢运动康复方法双向适应性研究

国家自然科学基金

0+阅读 · 2014年12月31日

行人检测中粒度空间特征提取方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

现实场景的多传感采样与三维重建方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

迁移学习在图像分类中的应用研究

国家自然科学基金

8+阅读 · 2013年12月31日

动态环境下基于概率图模型的机器人地点识别及实时语义地图构建新方法

国家自然科学基金

0+阅读 · 2012年12月31日

Wnt/β-catenin信号通路在TGF-β1诱导的真皮成纤维细胞向肌成纤维细胞表型转化中的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于运动模式在线学习的移动机器人对运动目标的主动观测与最优跟踪

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员