Uni6Dv3: 5D 6D粒子估计5D Anchor机制 (Uni6Dv3: 5D Anchor Mechanism for 6D Pose Estimation) - 专知论文

会员服务 ·

0

anchor · 估计/估计量 · 有向 · 偏移量 · 3D ·

2022 年 10 月 21 日

Uni6Dv3: 5D Anchor Mechanism for 6D Pose Estimation

翻译：Uni6Dv3: 5D 6D粒子估计5D Anchor机制

Jianqiu Chen,Mingshan Sun,Ye Zheng,Tianpeng Bao,Zhenyu He,Donghai Li,Guoqiang Jin,Rui Zhao,Liwei Wu,Xiaoke Jiang

Unlike indirect methods that usually require time-consuming post-processing, recent deep learning-based direct methods for 6D pose estimation try to predict the 3D rotation and 3D translation from RGB-D data directly. However, direct methods, regressing the absolute translation of the pose, suffer from diverse object translation distribution between training and test data, which is usually caused by expensive data collection and annotation in practice. To this end, we propose a 5D anchor mechanism by defining the anchor with 3D coordinates in the physical space and 2D coordinates in the image plane. Inspired by anchor-based object detection methods, 5D anchor regresses the offset between the target and anchor, which eliminates the distribution gap and transforms the regression target to a small range. But regressing offset leads to the mismatch between the absolute input and relative output. We build an anchor-based projection model by replacing the absolute input with the relative one, which further improves the performance. By plugging 5D anchor into the latest direct methods, Uni6Dv2 and ES6D obtain 38.7% and 3.5% improvement, respectively. Specifically, Uni6Dv2+5D anchor, dubbed Uni6Dv3, achieves state-of-the-art overall results on datasets including Occlusion LineMOD (79.3%), LineMOD (99.5%), and YCB-Video datasets (91.5%), and requires only 10% of training data to reach comparable performance as full data.

翻译：与通常需要耗时后处理的间接方法不同,最近对 6D 进行基于深深学习的直接方法的估算试图直接预测 RGB-D 数据中的 3D 旋转和 3D 翻译。但是, 直接方法, 使图像的绝对翻转倒退, 受培训和测试数据之间不同对象翻译分布的影响, 通常是昂贵的数据收集和实践中的注解造成的。为此, 我们提议了一个 5D 锁定机制, 其方法是在物理空间和图像平面的 2D 坐标中用 3D 定位定位点定义3D 坐标。在基于锚的物体探测方法的启发下, 5D 锚将目标与锁定之间的抵消, 从而消除分布差距, 并将回归目标转换到小范围。但是, 递增抵消导致绝对输入和相对输出之间的不匹配。我们建立一个基于锁定的预测模型, 将绝对输入替换为相对输入, 进一步改进性能。通过将 5D 锁定最新直接方法, Uni6Dv2 和ES6D 获得38.7% 和3.5% 改进。具体而言, UI6D+D+D 整个数据定义, 要求实现整个数据- D 包括% CLED IMOD 。

1

相关内容

anchor

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

冷冻空间软硬吸合面间湿空气呼吸效应机制

国家自然科学基金

0+阅读 · 2015年12月31日

IGFBP7对脓毒症小鼠肾小管上皮细胞分裂周期的影响及其机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

内质网应激IRE1－XBP1S通路在高糖引起肾脏及系膜细胞发生氧化应激及损伤中的机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于分形几何理论的公路路面初生裂纹辨识策略与定量评价机理

国家自然科学基金

0+阅读 · 2014年12月31日

含碳气溶胶光谱特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

综合污染物迁移机制和空间统计模型的土壤有机污染物空间分布预测

国家自然科学基金

0+阅读 · 2013年12月31日

地表温度-植被覆盖度特征空间蒸散发遥感反演的空间尺度效应及干湿边确定方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

符合正电子湮没技术研究ZnO压敏电阻中缺陷和3d电子

国家自然科学基金

0+阅读 · 2012年12月31日

A位稀土离子磁性对E-型反铁磁序锰氧化物多铁性的影响

国家自然科学基金

0+阅读 · 2012年12月31日

关联体系中电荷有序的原位同步辐射表征

国家自然科学基金

0+阅读 · 2011年12月31日

2D Human Pose Estimation with Explicit Anatomical Keypoints Structure Constraints

Arxiv

0+阅读 · 2022年12月5日

ObjectMatch: Robust Registration using Canonical Object Correspondences

Arxiv

0+阅读 · 2022年12月5日

A dataset for audio-video based vehicle speed estimation

Arxiv

0+阅读 · 2022年12月3日

FECAM: Frequency Enhanced Channel Attention Mechanism for Time Series Forecasting

Arxiv

0+阅读 · 2022年12月2日

RGB-D based Stair Detection using Deep Learning for Autonomous Stair Climbing

Arxiv

0+阅读 · 2022年12月2日

Leveraging Single-View Images for Unsupervised 3D Point Cloud Completion

Arxiv

1+阅读 · 2022年12月1日

Res6D: Projective Residual Regression for 6D Pose Estimation

Arxiv

1+阅读 · 2022年12月1日

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Arxiv

11+阅读 · 2021年6月25日

Spatio-Temporal Graph for Video Captioning with Knowledge Distillation

Spatio-Temporal Graph for Video Captioning with Knowledge Distillation

Arxiv

19+阅读 · 2020年3月31日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《城市滨海地区：理解复杂多变环境下的指挥控制框架》50页报告

《理解城市战及其在俄乌战争中的表现》报告

美空军“顶点2025”实验：推进AI在C2、动态目标锁定与联盟集成中的应用

《建设式兵棋模拟作为战术集群配置优化的关键组成部分》

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

2D Human Pose Estimation with Explicit Anatomical Keypoints Structure Constraints

Arxiv

0+阅读 · 2022年12月5日

ObjectMatch: Robust Registration using Canonical Object Correspondences

Arxiv

0+阅读 · 2022年12月5日

A dataset for audio-video based vehicle speed estimation

Arxiv

0+阅读 · 2022年12月3日

FECAM: Frequency Enhanced Channel Attention Mechanism for Time Series Forecasting

Arxiv

0+阅读 · 2022年12月2日

RGB-D based Stair Detection using Deep Learning for Autonomous Stair Climbing

Arxiv

0+阅读 · 2022年12月2日

Leveraging Single-View Images for Unsupervised 3D Point Cloud Completion

Arxiv

1+阅读 · 2022年12月1日

Res6D: Projective Residual Regression for 6D Pose Estimation

Arxiv

1+阅读 · 2022年12月1日

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Arxiv

11+阅读 · 2021年6月25日

Spatio-Temporal Graph for Video Captioning with Knowledge Distillation

Spatio-Temporal Graph for Video Captioning with Knowledge Distillation

Arxiv

19+阅读 · 2020年3月31日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

相关基金

冷冻空间软硬吸合面间湿空气呼吸效应机制

国家自然科学基金

0+阅读 · 2015年12月31日

IGFBP7对脓毒症小鼠肾小管上皮细胞分裂周期的影响及其机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

内质网应激IRE1－XBP1S通路在高糖引起肾脏及系膜细胞发生氧化应激及损伤中的机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于分形几何理论的公路路面初生裂纹辨识策略与定量评价机理

国家自然科学基金

0+阅读 · 2014年12月31日

含碳气溶胶光谱特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

综合污染物迁移机制和空间统计模型的土壤有机污染物空间分布预测

国家自然科学基金

0+阅读 · 2013年12月31日

地表温度-植被覆盖度特征空间蒸散发遥感反演的空间尺度效应及干湿边确定方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

符合正电子湮没技术研究ZnO压敏电阻中缺陷和3d电子

国家自然科学基金

0+阅读 · 2012年12月31日

A位稀土离子磁性对E-型反铁磁序锰氧化物多铁性的影响

国家自然科学基金

0+阅读 · 2012年12月31日

关联体系中电荷有序的原位同步辐射表征

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员